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5 This application claims prieftty-*rprior filed U.S. Provisional Patent Application 

Serial No. 60/142764, filed July 8, 1999, and U.S. Provisional Patent Application Serial 
ft No. 60/152,318, filed September 3, 1999. The entire contents of both of the 

aforementioned applications are hereby expressly incorporated herein by reference. 

10 Background of the Invention 

Certain products and by-products of naturally-occurring metabolic processes in 
cells have utility in a, wide array of industries, including the food, feed, cosmetics, and 
pharmaceutical industries. These molecules, collectively termed 'fine chemicals', 
include organic acids, both proteinogenic and non-proteinogenic amino acids, 
1 5 nucleotides and nucleosides, lipids and fatty acids, diols, carbohydrates, aromatic 
compounds, vitamins and cofactors, and enzymes. Their production is most 
conveniently performed through the large-scale culture of bacteria developed to produce 
and secrete large quantities of one or more desired molecules. One particularly useful 
organism for this purpose is Corynebacterium glutamicum, a gram positive, 
20 nonpathogenic bacterium. Through strain selection, a number of mutant strains have 
been developed which produce an array of desirable compounds. However, selection of 
strains improved for the production of a particular molecule is a time-consuming and 
difficult process. 

25 Summary of the Invention 

The invention provides novel bacterial nucleic acid molecules which have a 
variety of uses. These uses include the identification of microorganisms which can be 
used to produce fine chemicals, the modulation of fine chemical production in C. 
glutamicum or related bacteria, the typing or identification of C. glutamicum or related 
30 bacteria, as reference points for mapping the C. glutamicum genome, and as markers for 
transformation. These novel nucleic acid molecules encode proteins, referred to herein 
as marker and fine chemical production (MCP) proteins. 

C. glutamicum is a gram positive, aerobic bacterium which is commonly used in 
industry for the large-scale production of a variety of fine chemicals, and also for the 
3 5 degradation of hydrocarbons (such as in petroleum spills) and for the oxidation of 

terpenoids. The MCP nucleic acid molecules of the invention, therefore, can be used to 
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mentation processes. Mediation of the expression of , e MCP - 
invention or modification of the sequence of the MCP nucleic acid molecules of d* 
SI can be used to modulate the production of one or more fine chemicals from a 

5 from a Corynebacterium or Brevibacterium species). 

The MCP nucleic acids of the invention may also be used to identify an 
organism as being CorynebaCeriun, glu^cun, or a close relative thereof, or to 
id ntify the presence of C g/— or a relative thereof in a mixed popular » of 
microorganisms. The invention provides the nucleic acid sequences of a number ^ T. 

10 genes; by probing the extracted genomic DNA of a culture of a unique or 

region of a C. g/— gene which is unique to this organism, one can ascertain 
whether this organism is present. Although Corynebacmum glmamicum itself ,s 
I^I genic it is related, species pathogenic in humans, such a, C 0 .—-» 
, 5 IpLrL (the causative agent of diphtheria); the detection of such orgamsms .s of 

significant clinical relevance. 

The MCP nucleic acid molecules of the invention may also serve as reference 
points for mapping of the C. glu^icum genome, or of genomes of related organisms, 
airly, the* molecules, or variants or portions thereof, may serve as markers for 

tt) leneticallv engineered Corynebacterium or Brevibacterium species. 

20 genetically eg^ ^ ^ ^ ^ ^ ^ ^ ^ 

may be involved, for example, in the direct or indirect production of one or more fine 
chemicals from C. m. The MCP proteins of the invention may also participate 

" radation of hydrocarbons or the oxidation of terpenoids. These proteins may 
25 also be utilized for the identification of CorymbaCerium glu,amicum or organisms 

S to C. g/— , the presence of an MCP protein specif, to C. g,*— and 
related species in a mixture of proteins may indicate the presence of one of these 
bacteria in the sample. Further, these MCP proteins may have homologues ,n plants or 
animals which are nvolved in a dise.se state or condition; these proteins thus may serve 
30 rl 1 pharmaceutical targets for drug screening and the development of therapeutic 

compounds. . 

Given the availability of cloning vectors for use in CorymbaCenum 
gto — , such as those disclosed in Sinskey « oi, U.S. Paten, No. ,649 1 19, and 

35 species (,g., too/— ) (VoshihamaetaU. ~/. 162: 59 -597 (1985), 
v^^al.J Boceriol. 159: 306-311 (1984) ; andSantamana C , a /.,/ 
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utilized in the genetic engineering of this organism to modulate the production of one or 
more fine chemicals. This modulation may be due to a direct effect of manipulation of a 
gene of the invention, or it may be due to an indirect effect of such manipulation. For 

examplerby-modifying-the-aeti^^ 

5 degradation of a fine chemical (/'. e. , through mutagenesis of the corresponding gene), 
one may directly modulate the ability of the cell to synthesize or to degrade this 
compound, thereby modulating the yield and/or efficiency of production of the fine 
chemical. Similarly, by modulating the activity of a protein which regulates a fine 
chemical metabolic pathway, one may directly influence whether the production of the 

1 0 desired compound is up- or down-regulated, either of which will modulate the yield or 
efficiency of production of the fine chemical from the cell. 

Indirect modulation of fine chemical production may also result by modifying 
the activity of a protein of the invention (i.e., by mutagenesis of the corresponding gene) 
such that the overall ability of the cell to grow and divide or to remain viable and 

1 5 productive is increased. The production of fine chemicals from C. glutamicum is 

generally accomplished by the large-scale fermentative culture of these microorganisms, 
conditions which are frequently suboptimal for growth and cell division. By 
engineering a protein of the invention (e.g., a stress response protein, a cell wall protein, 
or proteins involved in the metabolism of compounds necessary for cell growth and 

20 division to occur, such as nucleotides and amino acids) such that it is better able to 
survive, grow, and multiply in such conditions, it may be possible to increase the 
number and productivity of such engineered C. glutamicum cells in large-scale culture, 
which in turn should result in increased yields and/or efficiency of production of one or 
more desired fine chemicals. Further, the metabolic pathways of any cell are necessarily 

25 interrelated and coregulated. By altering the activity or regulation of any one metabolic 
pathway in C. glutamicum (i.e., by altering the activity of one of the proteins of the 
invention which participates in such a pathway), it is possible to concomitantly alter the 
activity or regulation of other metabolic pathways in this microorganism, which may be 
directly involved in the synthesis or degradation of a fine chemical. 

30 The invention provides novel nucleic acid molecules which encode proteins, 

referred to herein as MCP proteins, which are capable of, for example, modulating the 
production or efficiency of production of one or more fine chemicals from C. 
glutamicum, or of serving as identifying markers for C. glutamicum or related 
organisms. Nucleic acid molecules encoding an MCP protein are referred to herein as 

35 MCP nucleic acid molecules. In a preferred embodiment, the MCP protein is capable of 
modulating the production or efficiency of production of one or more fine chemicals 
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organisms. Examples of such proteins include those encoded by the genes set forth in 
Table 1. 

Accordingly, one aspect of the invention pertains to isolated nucleic acid 
-molecules(e^^ 



5 an MCP protein or biologically active portions thereof, as well as nucleic acid fragments 
suitable as primers or hybridization probes for the detection or amplification of MCP- 
encoding nucleic acid (e.g. , DNA or mRNA). In particularly preferred embodiments, 
the isolated nucleic acid molecule comprises one of the nucleotide sequences set forth in 
Appendix A or the coding region or a complement thereof of one of these nucleotide 

10 sequences. In other particularly preferred embodiments, the isolated nucleic acid 

molecule of the invention comprises a nucleotide sequence which hybridizes to or is at 
least about 50%, preferably at least about 60%, more preferably at least about 70%, 80% 
or 90%, and even more preferably at least about 95%, 96%, 97%, 98%, 99% or more 
homologous to a nucleotide sequence set forth in Appendix A, or a portion thereof. In 

1 5 other preferred embodiments, the isolated nucleic acid molecule encodes one of the 
amino acid sequences set forth in Appendix B. The preferred MCP proteins of the 
present invention also preferably possess at least one of the MCP activities described 
herein. 

In another embodiment, the isolated nucleic acid molecule encodes a protein or 
20 portion thereof wherein the protein or portion thereof includes an amino acid sequence 
which is sufficiently homologous to an amino acid sequence of Appendix B, e.g., 
sufficiently homologous to an amino acid sequence of Appendix B such that the protein 
or portion thereof maintains an MCP activity. Preferably, the protein or portion thereof 
encoded by the nucleic acid molecule maintains the ability to modulate the production or 
25 efficiency of production of one or more fine chemicals from C. glutamicum, or of 
serving as an identifying marker for C. glutamicum or related organisms. In one 
embodiment, the protein encoded by the nucleic acid molecule is at least about 50%, 
preferably at least about 60%, and more preferably at least about 70%, 80%, or 90% and 
most preferably at least about 95%, 96%, 97%, 98%, or 99% or more homologous to an 
30 amino acid sequence of Appendix B (e.g. , an entire amino acid sequence selected from 
those sequences set forth in Appendix B). In another preferred embodiment, the protein 
is a full length C. glutamicum protein which is substantially homologous to an entire 
amino acid sequence of Appendix B (encoded by an open reading frame shown in 
Appendix A). 

35 In another preferred embodiment, the isolated nucleic acid molecule is derived 

from C. glutamicum and encodes a protein (e.g., an MCP fusion protein) which includes 
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the amino acid sequences of Appendix B and is able to modulate the yield, production, 
and/or efficiency of production of one or more fine chemicals from C glutamicum, to 
degrade hydrocarbons, to oxidize terpenoids, to serve as a target for drug development, 

or to serve as an identifying marke^fc^^ — 

5 also includes heterologous nucleic acid sequences encoding a heterologous polypeptide 
or regulatory regions. 

In another embodiment, the isolated nucleic acid molecule is at least 15 
nucleotides in length and hybridizes under stringent conditions to a nucleic acid 
molecule comprising a nucleotide sequence of Appendix A. Preferably, the isolated 

10 nucleic acid molecule corresponds to a naturally-occurring nucleic acid molecule. More 
preferably, the isolated nucleic acid encodes a naturally-occurring C glutamicum MCP 
protein, or a biologically active portion thereof. 

Another aspect of the invention pertains to vectors, e.g., recombinant expression 
vectors, containing the nucleic acid molecules of the invention, and host cells into which 

1 5 such vectors have been introduced. In one embodiment, such a host cell is used to 
produce an MCP protein by culturing the host cell in a suitable medium. The MCP 
protein can then be isolated from the medium or the host cell. 

Yet another aspect of the invention pertains to a genetically altered 
microorganism in which an MCP gene has been introduced or altered. In one 

20 embodiment, the genome of the microorganism has been altered by introduction of a 

nucleic acid molecule of the invention encoding wild-type or mutated MCP sequence as 
a transgene. In another embodiment, an endogenous MCP gene within the genome of 
the microorganism has been altered, e.g., functionally disrupted, by homologous 
recombination with an altered MCP gerie. In another embodiment, an endogenous or 

25 introduced MCP gene in a microorganism has been altered by one or more point 

mutations, deletions, or inversions, but still encodes a functional MCP protein. In still 
another embodiment, one or more of the regulatory regions {e.g., a promoter, repressor, 
or inducer) of an MCP gene in a microorganism has been altered (e.g., by deletion, 
truncation, inversion, or point mutation) such that the expression of the MCP gene is 

30 modulated. In a preferred embodiment, the microorganism belongs to the genus 
Corynebacterium or Brevibacterium, with Corynebacterium glutamicum being 
particularly preferred. In a preferred embodiment, the microorganism is also utilized for 
the production of a desired compound, such as an amino acid, with lysine being 
particularly preferred. 

35 In another aspect, the invention provides a method of identifying the presence or 

activity of Cornyebacterium diphtheriae in a subject. This method includes detection of 
one or more of the nucleic acid or amino acid sequences of the invention (e.g., the 
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sequences set forth in Appendix A or Appendix B) in a subject, thereby detecting the 
presence or activity of Corynebacterium diphtheriae in the subject. 

Still another aspect of the invention pertains to an isolated MCP protein or a 

portion7e.-g.Ta-biologically-active-portion r thereo£— ^In-a-preferred-embodiment,-the 

5 isolated MCP protein or portion thereof is capable of modulating the production or 
efficiency of production of one or more fine chemicals from C. glutamicum, or of 
serving as an identifying marker for C. glutamicum or related organisms. In another 
preferred embodiment, the isolated MCP protein or portion thereof is sufficiently 
homologous to an amino acid sequence of Appendix B such that the protein or portion 

1 0 thereof maintains the ability to, for example, modulate the production or efficiency of 
production of one or more fine chemicals from C. glutamicum, or to serve as identifying 
markers for C. glutamicum or related organisms. 

The invention also provides an isolated preparation of an MCP protein. In 
preferred embodiments, the MCP protein comprises an amino acid sequence of 

1 5 Appendix B. In another preferred embodiment, the invention pertains to an isolated full 
length protein which is substantially homologous to an entire amino acid sequence of 
Appendix B (encoded by an open reading frame set forth in Appendix A). In yet 
another embodiment, the protein is at least about 50%, preferably at least about 60%, 
and more preferably at least about 70%, 80%, or 90%, and most preferably at least about 

20 95%, 96%, 97%, 98%, or 99% or more homologous to an entire amino acid sequence of 
Appendix B. In other embodiments, the isolated MCP protein comprises an amino acid 
sequence which is at least about 50% or more homologous to one of the amino acid 
sequences of Appendix B and is able to modulate the yield, production, and/or 
efficiency of production of one or more fine chemicals from C. glutamicum, to degrade 

25 hydrocarbons, to oxidize terpenoids, to serve as a target for drug development, or to 
serve as an identifying marker for C. glutamicum or related organisms. 

Alternatively, the isolated MCP protein can comprise an amino acid sequence 
which is encoded by a nucleotide sequence which hybridizes, e.g., hybridizes under 
stringent conditions, or is at least about 50%, preferably at least about 60%, more 

30 preferably at least about 70%, 80%, or 90%, and even more preferably at least about 
95%, 96%, 97%, 98,%, or 99% or more homologous, to a nucleotide sequence of 
Appendix B. It is algo preferred that the preferred forms of MCP proteins also have one 
or more of the MCP bioactivities described herein. 

The MCP polypeptide, or a biologically active portion thereof, can be 

35 operatively linked to a non-MCP polypeptide to form a fusion protein. In preferred 
embodiments, this fusion protein has an activity which differs from that of the MCP 
protein alone. In other preferred embodiments, this fusion protein is capable of 
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modulating the yield, production and/or efficiency of production of one or more fine 
chemicals from C glutamicum, or of serving as an identifying marker for C. glutamicum 
or related organisms. In particularly preferred embodiments, integration of this fusion 

proteininto-aiiostcell-modulates-produ^ 

5 In another aspect, the invention provides methods for screening molecules which 

modulate the activity of an MCP protein, either by interacting with the protein itself or a 
substrate or binding partner of the MCP protein, or by modulating the transcription or 
translation of an MCP nucleic acid molecule of the invention. 

Another aspect of the invention pertains to a method for producing a fine 

10 chemical. This method involves the culturing of a cell containing a vector directing the 
expression of an MCP nucleic acid molecule of the invention, such that a fine chemical 
is produced. In a preferred embodiment, this method farther includes the step of 
obtaining a cell containing such a vector, in which a cell is transfected with a vector 
directing the expression of an MCP nucleic acid. In another preferred embodiment, this 

1 5 method further includes the step of recovering the fine chemical from the culture. In a 
particularly preferred embodiment, the cell is from the genus Corynebacterium or 
Brevibacterium, or is selected from those strains set forth in Table 3. 

Another aspect of the invention pertains to methods for modulating production of 
a molecule from a microorganism. Such methods include contacting the cell with an 

20 agent which modulates MCP protein activity or MCP nucleic acid expression such that a 
cell associated activity is altered relative to this same activity in the absence of the 
agent. In a preferred embodiment, the cell is modulated for one or more C glutamicum 
MCP protein activities, such that the yield, production, and/or efficiency of production 
of a desired fine chemical by this microorganism is improved. The agent which 

25 modulates MCP protein activity can be an agent which stimulates MCP protein activity 
or MCP nucleic acid expression. Examples of agents which stimulate MCP protein 
activity or MCP nucleic acid expression include small molecules, active MCP proteins, 
and nucleic acids encoding MCP proteins that have been introduced into the cell 
Examples of agents which inhibit MCP activity or expression include small molecules 

30 and antisense MCP nucleic acid molecules. 

Another aspect of the invention pertains to methods for modulating yields, 
production, and/or efficiency of production of a desired compound from a cell, 
involving the introduction of a wild-type or mutant MCP gene into a cell, either 
maintained on a separate plasmid or integrated into the genome of the host cell. If 

35 integrated into the genome, such integration can be random, or it can take place by 

homologous recombination such that the native gene is replaced by the introduced copy, 

^iiioinn tUo Kn-r^iWi^n nft\\f> Aoc\r(*A rnmnnnnH frnm thp rpll tn hp moHlllateH In a 
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preferred embodiment, said yields are increased. In another preferred embodiment, said 
chemical is a fine chemical. In a particularly preferred embodiment, said fine chemical 
is an amino acid. In especially preferred embodiments, said amino acid is L-lysine. 



5 Detailed Description of the Invention 

The present invention provides MCP nucleic acid and protein molecules. These 
MCP nucleic acid molecules may be utilized in the identification of Corynebacterium 
glutamicum or related organisms, in the mapping of the C. glutamicum genome (or a 
genome of a closely related organism), or in the identification of microorganisms which 

1 0 may be used to produce fine chemicals, e.g. , by fermentation processes. The proteins 
encoded by these nucleic acids may be utilized in the direct or indirect modulation of the 
production or efficiency of production of one or more fine chemicals from C. 
glutamicum, as identifying markers for C. glutamicum or related organisms, in the 
oxidation of terpenoids or the degradation of hydrocarbons, or as targets for the 

1 5 development of therapeutic pharmaceutical compounds. Aspects of the invention are 
further explicated below. 

I. Fine Chemicals 

The term 'fine chemical' is art-recognized and includes molecules produced by 

20 an organism which have applications in various industries, such as, but not limited to, 
the pharmaceutical, agriculture, and cosmetics industries. Such compounds include 
organic acids, such as tartaric acid, itaconic acid, and diaminopimelic acid, both 
proteinogenic and non-proteinogenic amino acids, purine and pyrimidine bases, 
nucleosides, and nucleotides (as described e.g. in Kuninaka, A. (1996) Nucleotides and " 

25 related compounds, p. 561-612, in Biotechnology vol. 6, Rehm et al, eds. VCH: 

Weinheim, and references contained therein), lipids, both saturated and unsaturated fatty 
acids {e.g., arachidonic acid), diols {e.g., propane diol, and butane diol), carbohydrates 
{e.g., hyaluronic acid and trehalose), aromatic compounds {e.g., aromatic amines, 
vanillin, and indigo), vitamins and cofactors (as described in Ullmann's Encyclopedia of 

30 Industrial Chemistry, vol. A27, "Vitamins", p. 443-613 (1996) VCH: Weinheim and 
references therein; and Ong, A.S., Niki, E. & Packer, L. (1995) "Nutrition, Lipids, 
Health, and Disease" Proceedings of the UNESCO/Confederation of Scientific and 
Technological Associations in Malaysia, and the Society for Free Radical Research - 
Asia, held Sept. 1-3, 1994 at Penang, Malaysia, AOCS Press, (1995)), enzymes, 

35 polyketides (Cane et al. (1 998) Science 282: 63-68), and all other chemicals described in 
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0818805086 and references therein. The metabolism and uses of certain of these fine 
chemicals are further explicated below. 

P^Amino^Acid Metabolism and Uses 

5 Amino acids comprise the basic structural units of all proteins, and as such are 

essential for normal cellular functioning in all organisms. The term "amino acid" is art- 
recognized. The proteinogenic amino acids, of which there are 20 species, serve as 
structural units for proteins, in which they are linked by peptide bonds, while the 
nonproteinogenic amino acids (hundreds of which are known) are not normally found in 

10 proteins (see Ulmann's Encyclopedia of Industrial Chemistry, vol. A2, p. 57-97 VCH: 
Weinheim (1985)). Amino acids may be in the D- or L- optical configuration, though L- 
amino acids are generally the only type found in naturally-occurring proteins. 
Biosynthetic and degradative pathways of each of the 20 proteinogenic amino acids 
have been well characterized in both prokaryotic and eukaryotic cells (see, for example, 

15 Stryer, L. Biochemistry, 3 rd edition, pages 578-590 (1988)). The 'essential' amino acids 
(histidine, isoleucine, leucine, lysine, methionine, phenylalanine, threonine, tryptophan, 
and valine), so named because they are generally a nutritional requirement due to the 
complexity of their biosyntheses, are readily converted by simple biosynthetic pathways 
to the remaining 1 1 'nonessential' amino acids (alanine, arginine, asparagine, aspartate, 

20 cysteine, glutamate, glutamine, glycine, proline, serine, and tyrosine). Higher animals 
do retain the ability to synthesize some of these amino acids, but the essential amino 
acids must be supplied from the diet in order for normal protein synthesis to occur. 

Aside from their function in protein biosynthesis, these amino acids are 
interesting chemicals in their own right, and many have been found to have various 

25 applications in the food, feed, chemical, cosmetics, agriculture, and pharmaceutical 
industries. Lysine is an important amino acid in the nutrition not only of humans, but 
also of monogastric animals such as poultry and swine. Glutamate is most commonly 
used as a flavor additive (mono-sodium glutamate, MSG) and is widely used throughout 
the food industry, as are aspartate, phenylalanine, glycine, and cysteine. Glycine, L- 

30 methionine and tryptophan are all utilized in the pharmaceutical industry. Glutamine, 
valine, leucine, isoleucine, histidine, arginine, proline, serine and alanine are of use in 
both the pharmaceutical and cosmetics industries. Threonine, tryptophan, and D/ L- 
methionine are common feed additives. (Leuchtenberger, W. (1996) Amino aids - 
technical production and use, p. 466-502 in Rehm et al (eds.) Biotechnology vol. 6, 

35 chapter 14a, VCH: Weinheim). Additionally, these amino acids have been found to be 
useful as precursors for the synthesis of synthetic amino acids and proteins, such as N- 
acetylcysteine, S-carboxymethyl-L-cysteine, (S)-5-hydroxytryptophan, and others 
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described in Ulmann's Encyclopedia of Industrial Chemistry, vol. A2, p. 57-97, VCH: 
Weinheim, 1985. 

The biosynthesis of these natural amino acids in organisms capable of 

producing-ta 

amino acid biosynthesis and regulation thereof, see Umbarger, H.E.(1978) Ann. Rev. 
Biochem. 47: 533-606). Glutamate is synthesized by the reductive amination of a- 
ketoglutarate, an intermediate in the citric acid cycle. Glutamine, proline, and arginine 
are each subsequently produced from glutamate. The biosynthesis of serine is a three- 
step process beginning with 3-phosphoglycerate (an intermediate in glycolysis), and 
resulting in this amino acid after oxidation, transamination, and hydrolysis steps. Both 
cysteine and glycine are produced from serine; the former by the condensation of 
homocysteine with serine, and the latter by the transferal of the side-chain P-carbon 
atom to tetrahydrofolate, in a reaction catalyzed by serine transhydroxymethylase. 
Phenylalanine, and tyrosine are synthesized from the glycolytic and pentose phosphate 
pathway precursors erythrose 4-phosphate and phosphoenolpyruvate in a 9-step 
biosynthetic pathway that differ only at the final two steps after synthesis of prephenate. 
Tryptophan is also produced from these two initial molecules, but its synthesis is an 1 1- 
step pathway. Tyrosine may also be synthesized from phenylalanine, in a reaction 
catalyzed by phenylalanine hydroxylase. Alanine, valine, and leucine are all 
biosynthetic products of pyruvate, the final product of glycolysis. Aspartate is formed 
from oxaloacetate, an intermediate of the citric acid cycle. Asparagine, methionine, 
threonine, and lysine are each produced by the conversion of aspartate. Isoleucine is 
formed from threonine. A complex 9-step pathway results in the production of histidine 
from 5-phosphoribosyl-l -pyrophosphate, an activated sugar. 

Amino acids in excess of the protein synthesis needs of the cell cannot be stored, 
and are instead degraded to provide intermediates for the major metabolic pathways of 
the cell (for review see Stryer, L. Biochemistry 3 rd ed. Ch. 21 "Amino Acid Degradation 
and the Urea Cycle" p. 495-516 (1988)). Although the cell is able to convert unwanted 
amino acids into useful metabolic intermediates, amino acid production is costly in 
terms of energy, precursor molecules, and the enzymes necessary to synthesize them. 
Thus it is not surprising that amino acid biosynthesis is regulated by feedback inhibition, 
in which the presence of a particular amino acid serves to slow or entirely stop its own 
production (for overview of feedback mechanisms in amino acid biosynthetic pathways, 
see Stryer, L. Biochemistry, 3 rd ed. Ch. 24: "Biosynthesis of Amino Acids and Heme" p. 
575-600 (1988)). Thus, the output of any particular amino acid is limited by the amount 
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B. Vitamin, Co/actor, and Nutraceutical Metabolism and Uses 

Vitamins, cofactors, and nutraceuticals comprise another group of molecules 
which the higher animals have lost the ability to synthesize and so must ingest, although 

they are readily synthesized by other-organisms such as -bacteria.- These molecules are — 

5 either bioactive substances themselves, or are precursors of biologically active 
substances which may serve as electron carriers or intermediates in a variety of 
metabolic pathways. Aside from their nutritive value, these compounds also have 
significant industrial value as coloring agents, antioxidants, and catalysts or other 
processing aids. (For an overview of the structure, activity, and industrial applications 

1 0 of these compounds, see, for example, Ullman' s Encyclopedia of Industrial Chemistry, 
"Vitamins" vol. A27, p. 443-613, VCH: Weinheim, 1996.) The term "vitamin" is art- 
recognized, and includes nutrients which are required by an organism for normal 
functioning, but which that organism cannot synthesize by itself. The group of vitamins 
may encompass cofactors and nutraceutical compounds. The language "cofactor" 

1 5 includes nonproteinaceous compounds required for a normal enzymatic activity to 
occur. Such compounds may be organic or inorganic; the cofactor molecules of the 
invention are preferably organic. The term "nutraceutical" includes dietary supplements 
having health benefits in plants and animals, particularly humans. Examples of such 
molecules are vitamins, antioxidants, and also certain lipids {e.g., polyunsaturated fatty 

20 acids). 

The biosynthesis of these molecules in organisms capable of producing them, 
such as bacteria, has been largely characterized (Ullman' s Encyclopedia of Industrial 
Chemistry, "Vitamins" vol. A27, p. 443-613, VCH: Weinheim, 1996; Michal, G. (1999) 
Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, John Wiley 

25 & Sons; Ong, A.S., Niki, E. & Packer, L. (1 995) "Nutrition, Lipids, Health, and 

Disease" Proceedings of the UNESCO/Confederation of Scientific and Technological 
Associations in Malaysia, and the Society for Free Radical Research - Asia, held Sept. 
1-3, 1994 at Penang, Malaysia, AOCS Press: Champaign, IL X, 374 S). 

Thiamin (vitamin Bi) is produced by the chemical coupling of pyrimidine and 

30 thiazole moieties. Riboflavin (vitamin B 2 ) is synthesized from guanosine-5' -triphosphate 
(GTP) and ribose-5' -phosphate. Riboflavin, in turn, is utilized for the synthesis of flavin 
mononucleotide (FMN) and flavin adenine dinucleotide (FAD). The family of 
compounds collectively termed 'vitamin B 6 ' (e.g., pyridoxine, pyridoxamine, pyridoxa- 
5'-phosphate, and the commercially used pyridoxin hydrochloride) are all derivatives of 

35 the common structural unit, 5-hydroxy-6-methylpyridine. Pantothenate (pantothenic 
acid, (R)-(+)-N-(2,4-dihydroxy-3,3-dimethyl-l-oxobutyl)-p-alanine) can be produced 
either by chemical synthesis or by fermentation. The final steps in pantothenate 
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biosynthesis consist of the ATP-driven condensation of p-alanine and pantoic acid. The 
enzymes responsible for the biosynthesis steps for the conversion to pantoic acid, to p- 
alanine and for the condensation to panthotenic acid are known. The metabolically 
activeformofijanto^^ 

- . » . . • ] A TO rt «.a tUfl 



enzymatic steps. Pantothenate, pyridoxal-5' -phosphate, cysteine and ATP are the 
precursors of Coenzyme A. These enzymes not only catalyze the formation of 
panthothante, but also the production of (R)-pantoic acid, (R)-pantolacton, (R)- 
panthenol (provitamin B 5 ), pantetheine (and its derivatives) and coenzyme A. 

Biotin biosynthesis from the precursor molecule pimeloyl-CoA in 
microorganisms has been studied in detail and several of the genes involved have been 
identified. Many of the corresponding proteins have been found to also be involved in 
Fe-cluster synthesis and are members of the nifS class of proteins. Lipoic acid is 
derived from octanoic acid, and serves as a coenzyme in energy metabolism, where it 
becomes part of the pyruvate dehydrogenase complex and the a-ketoglutarate 
dehydrogenase complex. The folates are a group of substances which are all derivatives 
of folic acid, which is turn is derived from L-glutamic acid, p-amino-benzoic acid and 6- 
methylpterin. The biosynthesis of folic acid and its derivatives, starting from the 
metabolism intermediates guanosine-5' -triphosphate (OTP), L-glutamic acid and p- 
amino-benzoic acid has been studied in detail in certain microorganisms. 

Corrinoids (such as the cobalamines and particularly vitamin B 12 ) and 
porphyrias belong to a group of chemicals characterized by a tetrapyrole ring system. 
The biosynthesis of vitamin B 12 is sufficiently complex that it has not yet been 
completely characterized, but many of the enzymes and substrates involved are now 
known. Nicotinic acid (nicotinate), and nicotinamide are pyridine derivatives which are 
also termed 'niacin'. Niacin is the precursor of the important coenzymes NAD 
(nicotinamide adenine dinucleotide) andNADP (nicotinamide adenine dinucleotide 
phosphate) and their reduced forms. 

The large-scale production of these compounds has largely relied on cell-free 
chemical syntheses, though some of these chemicals have also been produced by large- 
30 scale culture of microorganisms, such as riboflavin, Vitamin B 6 , pantothenate, and 
biotin. Only Vitamin B 12 is produced solely by fermentation, due to the complexity of 
its synthesis. In vitro methodologies require significant inputs of materials and time, 
often at great cost. 

35 C. Purine, Pyrimidine, Nucleoside and Nucleotide Metabolism and Uses 

Purine and pyrimidine metabolism genes and their corresponding proteins are 
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"purine" or "pyrimidine" includes the nitrogenous bases which are constituents of 
nucleic acids, co-enzymes, and nucleotides. The term "nucleotide" includes the basic 
structural units of nucleic acid molecules, which are comprised of a nitrogenous base, a 

pentose-sugarfirthe-cas^ — 

5 D-deoxyribose), and phosphoric acid. The language "nucleoside" includes molecules 
which serve as precursors to nucleotides, but which are lacking the phosphoric acid 
moiety that nucleotides possess. By inhibiting the biosynthesis of these molecules, or 
their mobilization to form nucleic acid molecules, it is possible to inhibit RNA and DNA 
synthesis; by inhibiting this activity in a fashion targeted to cancerous cells, the ability 

1 0 of tumor cells to divide and replicate may be inhibited. Additionally, there are 

nucleotides which may serve as energy stores (e.g., ADP, ATP) or as coenzymes (i.e., 
FAD and NAD). 

Several publications have described the use of these chemicals for these medical 
indications, by influencing purine and/or pyrimidine metabolism (e.g. Christopherson, 

1 5 R.I. and Lyons, S.D. (1990) "Potent inhibitors of de novo pyrimidine and purine 

biosynthesis as chemotherapeutic agents." Med. Res. Reviews 10: 505-548). Studies of 
enzymes involved in purine and pyrimidine metabolism have been focused on the 
development of new drugs which can be used, for example, as immunosuppressants or 
anti-proliferants (Smith, J.L., (1995) "Enzymes in nucleotide synthesis." Curr. Opin. 

20 Struct. Biol. 5: 752-757; (1995) Biochem Soc. Transact. 23: 877-902). However, purine 
and pyrimidine bases, nucleosides and nucleotides have other utilities: as intermediates 
in the biosynthesis of several fine chemicals (e.g., thiamine, S-adenosyl-methionine, 
folates, or riboflavin), as energy carriers for the cell (e.g., ATP or GTP), and for 
chemicals themselves, commonly used as flavor enhancers (e.g. , IMP or GMP) or for 

25 several medicinal applications (see, for example, Kuninaka, A. (1 996) Nucleotides and 
Related Compounds in Biotechnology vol. 6, Rehm et al, eds. VCH: Weinheim, p. 561- 
612). Also, enzymes involved in purine, pyrimidine, nucleoside, or nucleotide 
. metabolism are increasingly serving as targets against which chemicals for crop 
protection, including fungicides, herbicides and insecticides, are developed. 

30 The metabolism of these compounds in bacteria has been characterized (for 

reviews see, for example, Zalkin, H. and Dixon, J.E. (1992) "de novo purine nucleotide 
biosynthesis", in: Progress in Nucleic Acid Research and Molecular Biology, vol. 42, 
Academic Press:, p. 259-287; and Michal, G. (1999) "Nucleotides and Nucleosides", 
Chapter 8 in: Biochemical Pathways: An Atlas of Biochemistry and Molecular Biology, 

35 Wiley: New York). Purine metabolism has been the subject of intensive research, and is 
essential to the normal functioning of the cell. Impaired purine metabolism in higher 
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ribose-5-phosphate, in a series of steps through the intermediate compound inosine-5'- 
phosphate (IMP), resulting in the production of guanosine-5 ' -monophosphate (GMP) or 
adenosine-5' -monophosphate (AMP), from which the triphosphate forms utilized as 
nucleotides-areTeadily-fo — 
5 their degradation provides energy for many different biochemical processes in the cell. 
Pyrimidine biosynthesis proceeds by the formation of uridine-5 '-monophosphate (UMP) 
from ribose-5-phosphate. UMP, in turn, is converted to cytidine-5 '-triphosphate (CTP). 
The deoxy- forms of all of these nucleotides are produced in a one step reduction 
reaction from the diphosphate ribose form of the nucleotide to the diphosphate 
1 0 deoxyribose form of the nucleotide. Upon phosphorylation, these molecules are able to 
participate in DNA synthesis. 

D. Trehalose Metabolism and Uses 

Trehalose consists of two glucose molecules, bound in a, a- 1,1 linkage. It is. 
1 5 commonly used in the food industry as a sweetener, an additive for dried or frozen 

foods, and in beverages. However, it also has applications in the pharmaceutical, 

cosmetics and biotechnology industries (see, for example, Nishimoto et ah, (1998) U.S. 

Patent No. 5,759,610; Singer, M.A. and Lindquist, S. (1998) Trends Biotech. 16: 460- 

467; Paiva, C.L.A. and Panek, A.D. (1996) Biotech. Ann. Rev. 2: 293-314; and 
20 Shiosaka, M. (1 997) J. Japan 1 72: 97- 1 02). Trehalose is produced by enzymes from 

many microorganisms and is naturally released into the surrounding medium, from 

which it can be collected using methods known in the art. 

II. Elements and Methods of the Invention 

25 The present invention is based, at least in part, on the discovery of novel 

molecules, referred to herein as MCP nucleic acid molecules. These MCP nucleic acid 
molecules are useful not only for the identification of C. glutamicum or related bacterial 
species, but also as markers for the mapping of the C. glutamicum genome and in the 
identification of bacteria useful for the production of fine chemicals by, e.g., 

30 fermentative processes. The present invention is also based, at least in part, on the MCP 
protein molecules encoded by these MCP nucleic acid molecules. These MCP proteins 
are capable of modulating the yield, production, and/or efficiency of production of one 
or more fine chemicals from C. glutamicum, of serving as identifying markers for C. 
glutamicum or related organisms, of degrading hydrocarbons, and of serving as targets 

35 for the development of therapeutic pharmaceutical compounds. In one embodiment, the 
MCP molecules of the invention directly or indirectly participate in one or more fine 
chemical metabolic pathways in C. glutamicum. In a preferred embodiment, the activity 
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of the MCP molecules of the invention to indirectly or directly participate in such 
metabolic pathways has an impact on the production of a desired fine chemical by this 
microorganism. In a particularly preferred embodiment, the MCP molecules of the 
invention-are-modulated-in-actm^ 

5 which the MCP proteins of the invention participate are modulated in efficiency or 
output, which either directly or indirectly modulates the production or efficiency of 
production of a desired fine chemical by C. glutamicum. 

The language, "MCP protein" or "MCP polypeptide" includes proteins which are 
able to modulate the yield, production, and/or efficiency of production of one or more 

10 fine chemicals from C. glutamicum, to degrade hydrocarbons, to oxidize terpenoids, to 
serve as a target protein for drug screening or design, or to serve as identifying markers 
for C. glutamicum or related organisms. Examples of MCP proteins include those 
encoded by the MCP genes set forth in Table 1 and Appendix A. The terms "MCP 
gene" or "MCP nucleic acid sequence" include nucleic acid sequences encoding an 

1 5 MCP protein, which consist of a coding region and also corresponding untranslated 5 ' 
and 3' sequence regions. Examples of MCP genes include those set forth in Table 1. The 
terms "production" or "productivity" are art-recognized and include the concentration of 
the fermentation product (for example, the desired fine chemical) formed within a given 
time and a given fermentation volume (e.g., kg product per hour per liter). The term 

20 "efficiency of production" includes the time required for a particular level of production 
to be achieved (for example, how long it takes for the cell to attain a particular rate of 
output of a fine chemical). The term "yield" or "product/carbon yield" is art-recognized 
and includes the efficiency of the conversion of the carbon source into the product (i.e., 
fine chemical). This'is generally written as,"for example, kg product per kg carbon 

25 source. By increasing the yield or production of the compound, the quantity of 
recovered molecules, or of useful recovered molecules of that compound in a given 
amount of culture over a given amount of time is increased. The terms "biosynthesis" or 
a "biosynthetic pathway" are art-recognized and include the synthesis of a compound, 
preferably an organic compound, by a cell from intermediate compounds in what may 

30 be a multistep and highly regulated process. The terms "degradation" or a "degradation 
pathway" are art-recognized and include the breakdown of a compound, preferably an 
organic compound, by a cell to degradation products (generally speaking, smaller or less 
complex molecules) in what may be a multistep and highly regulated process. The 
language "metabolism" is art-recognized and includes the totality of the biochemical 

35 reactions that take place in an organism. The metabolism of a particular compound, 
then, (e.g., the metabolism of an amino acid such as glycine) comprises the overall 
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biosynthetic, modification, and degradation pathways in the cell related to this 
compound. 

In another embodiment, the MCP molecules of the invention are capable of 

rnodulating-the production of a desired molecule,-such-as-a-fine-chemical,-in a 

5 microorganism such as C. glutamicum, either directly or indirectly. Using recombinant 
genetic techniques, one or more of the MCP proteins of the invention may be 
manipulated such that its function is modulated. Such modulation of function may result 
in the modulation of the yield, production, and/or efficiency of production of one or 
more fine chemicals from C. glutamicum. 

1 0 For example, by modifying the activity of a protein involved in the biosynthesis 

or degradation of a fine chemical (i.e., through mutagenesis of the corresponding gene), 
one may directly modulate the ability of the cell to synthesize or to degrade this 
compound, thereby modulating the yield and/or efficiency of production of the fine 
chemical. Similarly, by modulating the activity of a protein which regulates a fine 

1 5 chemical metabolic pathway, one may directly influence whether the production of the 
desired compound is up- or down-regulated, either of which will modulate the yield or 
efficiency of production of the fine chemical from the cell. 

Indirect modulation of fine chemical production may also result by modifying 
the activity of a protein of the invention (i.e., by mutagenesis of the corresponding gene) 

20 such that the overall ability of the cell to grow and divide or to remain viable and 
productive is increased. The production of fine chemicals from C. glutamicum is 
generally accomplished by the large-scale fermentative culture of these microorganisms, 
conditions which are frequently suboptimal for growth and cell division. By 
engineering a protein of the invention (e.g. , a stress response protein, a cell wall protein, 

25 or proteins involved in the metabolism of compounds necessary for cell growth and 
division to occur, such as nucleotides and amino acids) such that it is better able to 
survive, grow, and multiply in such conditions, it may be possible to increase the 
number and productivity of such engineered C. glutamicum cells in large-scale culture, 
which in turn should result in increased yields and/or efficiency of production of one or 

30 more desired fine chemicals. Further, the metabolic pathways of any cell are necessarily 
interrelated and coregulated. By altering the activity or regulation of any one metabolic 
pathway in C. glutamicum (i.e., by altering the activity of one of the proteins of the 
invention which participates in such a pathway), it is possible to concomitantly alter the 
activity or regulation of other metabolic pathways in this microorganism, which may be 

35 directly involved in the synthesis or degradation of a fine chemical. 
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Culture Collection, given designation ATCC 13032. The nucleotide sequences of the 
isolated C. glutamicum MCP nucleic acid molecules and the predicted amino acid 
sequences of the C. glutamicum MCP proteins are shown in Appendices A and B, 
respectivelyr-Computational-analyse 
5 many of these nucleotide sequences as sequences having homology to E. co\i or Bacillus 
subtilis genes. 

The present invention also pertains to proteins which have an amino acid 
sequence which is substantially homologous to an amino acid sequence of Appendix B. 
As used herein, a protein which has an amino acid sequence which is substantially 

1 0 homologous to a selected amino acid sequence is least about 50% homologous to the 
selected amino acid sequence, e.g., the entire selected amino acid sequence. A protein 
which has an amino acid sequence which is substantially homologous to a selected 
amino acid sequence can also be least about 50-60%, preferably at least about 60-70%, 
and more preferably at least about 70-80%, 80-90%, or 90-95%, and most preferably at 

1 5 least about 96%, 97%, 98%, 99% or more homologous to the selected amino acid 
sequence. 

The MCP protein or a biologically active portion or fragment thereof of the 
invention is able to modulate the yield, production, and/or efficiency of production of 
one or more fine chemicals from C. glutamicum, to degrade hydrocarbons, to oxidize 
20 terpenoids, to serve as a target for drug development, or to serve as an identifying 
marker for C. glutamicum or related organisms. 

Various aspects of the invention are described in further detail in the following 
subsections: 

25 A. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated nucleic acid molecules that 
encode MCP polypeptides or biologically active portions thereof, as well as nucleic acid 
fragments sufficient for use as hybridization probes or primers for the identification or 
amplification of MCP-encoding nucleic acid {e.g., MCP DNA). These nucleic acid 

30 molecules may be used to identify C. glutamicum or related organisms, to map the 
genome of C. glutamicum or closely related bacteria, or to identify microorganisms 
useful for the production of fine chemicals, e.g., by fermentative processes. As used 
herein, the term "nucleic acid molecule" is intended to include DNA molecules {e.g., 
cDNA or genomic DNA) and RNA molecules {e.g., mRNA) and analogs of the DNA or 

35 RNA generated using nucleotide analogs. This term also encompasses untranslated 
sequence located at both the 3' and 5' ends of the coding region of the gene: at least 
about 100 nucleotides of sequence upstream from the 5' end of the coding region and at 
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least about 20 nucleotides of sequence downstream from the 3 5 end of the coding region 
of the gene. The nucleic acid molecule can be single-stranded or double-stranded, but 
preferably is double-stranded DNA. An "isolated" nucleic acid molecule is one which is 

separated from other nucleic acid-mole^ 

5 the nucleic acid. Preferably, an "isolated" nucleic acid is free of sequences which 
naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' ends of the 
nucleic acid) in the genomic DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the isolated MCP nucleic acid molecule 
can contain less than about 5 kb, 4kb, 3kb, 2kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide 

10 sequences which naturally flank the nucleic acid molecule in genomic DNA of the cell 
from which the nucleic acid is derived (e.g, a C glutamicum cell). Moreover, an 
"isolated" nucleic acid molecule, such as a DNA molecule, can be substantially free of 
other cellular material, or culture medium when produced by recombinant techniques, or 
chemical precursors or other chemicals when chemically synthesized. 

1 5 A nucleic acid molecule of the present invention, e.g. , a nucleic acid molecule 

having a nucleotide sequence of Appendix A, or a portion thereof, can be isolated using 
standard molecular biology techniques and the sequence information provided herein. 
For example, a C. glutamicum MCP DNA can be isolated from a C glutamicum library 
using all or portion of one of the sequences of Appendix A as a hybridization probe and 

20 standard hybridization techniques {e.g., as described in Sambrook, J., Fritsh, E. F., and 
Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 1989). 
Moreover, a nucleic acid molecule encompassing all or a portion of one of the sequences 
of Appendix A can "be isolated by the polymerase chain reaction using oligonucleotide 

25 primers designed based upon this sequence (e.g., a nucleic acid molecule encompassing 
all or a portion of one of the sequences of Appendix A can be isolated by the polymerase 
chain reaction using oligonucleotide primers designed based upon this same sequence of 
Appendix A). For example, mRNA can be isolated from normal endothelial cells (e.g., 
by the guanidinium-thiocyanate extraction procedure of Chirgwin et al (1979) 

30 Biochemistry 18: 5294-5299) and DNA can be prepared using reverse transcriptase (e.g. , 
Moloney MLV reverse transcriptase, available from Gibco/BRL, Bethesda, MD; or 
AMV reverse transcriptase, available from Seikagaku America, Inc., St. Petersburg, FL) 
and random polynucleotide primers or oligonucleotide primers based upon one of the 
nucleotide sequences shown in Appendix A. Synthetic oligonucleotide primers for 

35 polymerase chain reaction amplification can be designed based upon one of the 

nucleotide sequences shown in Appendix A. A nucleic acid of the invention can be 
amplified using cDNA or, alternatively, genomic DNA, as a template and appropriate 
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oligonucleotide primers according to standard PCR amplification techniques. The 
nucleic acid so amplified can be cloned into an appropriate vector and characterized by 
DNA sequence analysis. Furthermore, oligonucleotides corresponding to an MCP 

nucleotide-sequero^ 

5 automated DNA synthesizer. 

In a preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises one of the nucleotide sequences shown in Appendix A. The sequences of 
Appendix A correspond to the Corynebacterium glutamicum MCP DNAs of the 
invention. This cDNA comprises sequences encoding MCP proteins {i.e., the "coding 
1 0 region", indicated in each sequence in Appendix A), as well as 5' untranslated sequences 
and 3* untranslated sequences, also indicated in Appendix A. Alternatively, the nucleic 
acid molecule can comprise only the coding region of any of the sequences in Appendix 
A. 

For the purposes of this application, it will be understood that each of the 

1 5 sequences set forth in Appendix A has an identifying RXA or RXN number having the 
designation "RXA" or "RXN" followed by 5 digits (i.e., RXA00003 or RXN00022). 
Each of these sequences comprises up to three parts: a 5' upstream region, a coding 
region, and a downstream region. Each of these three regions is identified by the same 
RXA or RXN designation to eliminate confusion. The recitation "one of the sequences 

20 in Appendix A", then, refers to any of the sequences in Appendix A, which may be 
distinguished by their differing RXA or RXN designations. The coding region of each 
of these sequences is translated into a corresponding amino acid sequence, which is set 
forth in Appendix B. The sequences of Appendix B are identified by the same RXA or 
RXN designations as Appendix A, such that they can be readily correlated. For 

25 example, the amino acid sequence in Appendix B designated RXA00003 is a translation 
of the coding region of the nucleotide sequence of nucleic acid molecule RXA00003 in 
Appendix A, and the amino acid sequence in Appendix B designated RXN00022 is a 
translation of the coding region of the nucleotide sequence of nucleic acid molecule 
RXN00022 in Appendix A. Each of the RXA and RXN nucleotide and amino acid 

30 sequences of the invention has also been assigned a SEQ ID NO, as indicated in Table 1 . 

Several of the genes of the invention are "F-designated genes". An F-designated 
gene includes those genes set forth in Table 1 which have an 'F' in front of the RXA 
designation. For example, SEQ ID NO:3, designated, as indicated on Table 1 , as "F 
RXA01638", is an F-designated gene, as are SEQ ID NOs: 5, 9, and 1 1 (designated on 
35 Table 1 as "F RXA01639", "F RXA01590", and "F RXA01542", respectively). 

In one embodiment, the nucleic acid molecules of the present invention are not 
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In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which is a complement of one of the 
nucleotide sequences shown in Appendix A, or a portion thereof. A nucleic acid 

molecule^ 

5 Appendix A is one which is sufficiently complementary to one of the nucleotide 
sequences shown in Appendix A such that it can hybridize to one of the nucleotide 
sequences shown in Appendix A, thereby forming a stable duplex. 

In still another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleotide sequence which is at least about 50%, 51%, 52%, 53%, 

10 54%, 55%, 56%, 57%, 58%, 59%, or 60%, preferably at least about 61%, 62%, 63%, 
64%, 65%, 66%, 67%, 68%, 69%, or 70%, more preferably at least about 71%, 72%, 
73%, 74%, 75%, 76%, 77%, 78%, 79%, or 80%, 81%, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, 89%, or 90%, or 91%, 92%, 93%, 94%, and even more preferably at least 
about 95%, 96%, 97%, 98%, 99% or more homologous to a nucleotide sequence shown 

1 5 in Appendix A, or a portion thereof. Ranges and identity values intermediate to the 
above-recited ranges, (e.g., 70-90% identical or 80-95% identical) are also intended to 
be encompassed by the present invention. For example, ranges of identity values using a 
combination of any of the above values recited as upper and/or lower limits are intended 
to be included. In an additional preferred embodiment, an isolated nucleic acid 

20 molecule of the invention comprises a nucleotide sequence which hybridizes, e. g. , 
hybridizes under stringent conditions, to one of the nucleotide sequences shown in 
Appendix A, or a portion thereof. 

Moreover, the nucleic acid molecule of the invention can comprise only a 
portion of the coding region of one of the sequences in Appendix A, for example a 

25 fragment which can be used as a probe or primer or a fragment encoding a biologically 
active portion of an MCP protein. The nucleotide sequences determined from the 
cloning of the MCP genes from C. glutamicum allows for the generation of probes and 
primers designed for use in identifying and/or cloning MCP homologues in other cell 
types and organisms, as well as MCP homologues from other Corynebacteria or related 

30 species. The probe/primer typically comprises substantially purified oligonucleotide. 
The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 12, preferably about 25, more preferably 
about 40, 50 or 75 consecutive nucleotides of a sense strand of one of the sequences set 
forth in Appendix A, an anti-sense sequence of one of the sequences set forth in 

35 Appendix A, or naturally occurring mutants thereof. Primers based on a nucleotide 
sequence of Appendix A can be used in PCR reactions to clone MCP homologues. 

the MCP nucleotide sequences can be used to detect transcripts or 
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genomic sequences encoding the same or homologous proteins. In preferred 
embodiments, the probe further comprises a label group attached thereto, e.g. the label 
group can be a radioisotope, a fluorescent compound, an enzyme, or an enzyme co- 
-factor-Such-probes-ean-be-used-as-a^ 



5 which misexpress an MCP protein, such as by measuring a level of an MCP-encoding 
nucleic acid in a sample of cells, e.g., detecting MCP mRNA levels or determining 
whether a genomic MCP gene has been mutated or deleted. 

In one embodiment, the nucleic acid molecule of the invention encodes a protein 
or portion thereof which includes an amino acid sequence which is sufficiently 

10 homologous to an amino acid sequence of Appendix B such that the protein or portion 
thereof maintains the ability to modulate the yield, production, and/or efficiency of 
production of one or more fine chemicals from C. glutamicum, to degrade hydrocarbons, 
to oxidize terpenoids, to serve as a target for drug development, or to serve as an 
identifying marker for C. glutamicum or related organisms. As used herein, the 

1 5 language "sufficiently homologous" refers to proteins or portions thereof which have 
amino acid sequences which include a minimum number of identical or equivalent (e.g. , 
an amino acid residue which has a similar side chain as an amino acid residue in one of 
the sequences of Appendix B) amino acid residues to an amino acid sequence of 
Appendix B such that the protein or portion thereof is able to modulate the yield, 

20 production, and/or efficiency of production of one or more fine chemicals from C. 

glutamicum, to degrade hydrocarbons, to oxidize terpenoids, to serve as a target for drug 
development, or to serve as an identifying marker for C. glutamicum or related 
organisms. Examples of such activities are also described herein. Thus, "the function of 
an MCP protein" contributes to the overall regulation of one or more fine chemical 

25 metabolic pathways, or to the degradation of a hydrocarbon, or to the oxidation of a 
terpenoid. 

In another embodiment, the protein is at least about 50-60%, preferably at least 
about 60-70%, and more preferably at least about 70-80%, 80-90%, 90-95%, and most 
preferably at least about 96%, 97%, 98%, 99% or more homologous to an entire amino 

30 acid sequence of Appendix B. 

Portions of proteins encoded by the MCP nucleic acid molecules of the invention 
are preferably biologically active portions of one of the MCP proteins. As used herein, 
the term "biologically active portion of an MCP protein" is intended to include a portion, 
e.g. , a domain/motif, of an MCP protein that modulates the yield, production, and/or 

35 efficiency of production of one or more fine chemicals from C. glutamicum, that 
degrades hydrocarbons, that oxidizes terpenoids, that may serve as a target for drug 
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organisms. To determine whether an MCP protein or a biologically active portion 
thereof can modulate the yield, production, and/or efficiency of production of one or 
more fine chemicals from C. glutamicum, can degrade hydrocarbons, or can oxidize 

terpenoidsran-assay-of-aetivity-may-be-pe^^ 

5 to those of ordinary skill in the art, as detailed in Example 8 of the Exemplification. 

Additional nucleic acid fragments encoding biologically active portions of an 
MCP protein can be prepared by isolating a portion of one of the sequences in Appendix 
B, expressing the encoded portion of the MCP protein or peptide (e.g., by recombinant 
expression in vitro) and assessing the activity of the encoded portion of the MCP protein 

10 or peptide. 

The invention further encompasses nucleic acid molecules that differ from one of 
the nucleotide sequences shown in Appendix A (and portions thereof) due to degeneracy 
of the genetic code and thus encode the same MCP protein as that encoded by the 
nucleotide sequences shown in Appendix A. In another embodiment, an isolated nucleic 
1 5 acid molecule of the invention has a nucleotide sequence encoding a protein having an 
amino acid sequence shown in Appendix B. In a still further embodiment, the nucleic 
acid molecule of the invention encodes a full length C. glutamicum protein which is 
substantially homologous to an amino acid sequence of Appendix B (encoded by an 
open reading frame shown in Appendix A). 
20 It will be understood by one of ordinary skill in the art that in one embodiment 

the sequences of the invention are not meant to include the sequences of the prior art, 
such as those Genbank sequences set forth in Tables 2 or 4 which were available prior to 
the present invention. In one embodiment, the invention includes nucleotide and amino 
acid sequences having a percent identity to a nucleotide or amino acid sequence of the 
25 invention which is greater than that of a sequence of the prior art (e.g., a Genbank 
sequence (or the protein encoded by such a sequence) set forth in Tables 2 or 4). For 
example, the invention includes a nucleotide sequence which is greater than and/or at 
least 39% identical to the nucleotide sequence designated RXA00008 (SEQ ID 
NO: 1549), a nucleotide sequence which is greater than and/or at least 42% identical to 
30 the nucleotide sequence designated RXA00059 (SEQ ID NO:1571), and a nucleotide 
sequence which is greater than and/or at least 39% identical to the nucleotide sequence 
designated RXA00096 (SEQ ID NO:93). One of ordinary skill in the art would be able 
to calculate the lower threshold of percent identity for any given sequence of the 
invention by examining the GAP-calculated percent identity scores set forth in Table 4 
35 for each of the three top hits for the given sequence, and by subtracting the highest 
GAP-calculated percent identity from 100 percent. One of ordinary skill in the art will 
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greater than the lower threshold so calculated (e.g., at leas. 50%, 5!%, 52%, 53% 54%, 
55%, 56%, 57%, 58%, 59%, or 60%, preferably a, least about 61*. 62%, 63%, 4/o, 
65% 66%, 67%, 68%, 69%, or 70%, more preferably at least about 71 /., 72 /. 73 /., 

5 88% 89%, or 90%, or 91%, 92%, 93%, 94%, and even more preferably at leas, about 
95% 96% 97%, 98%, 99% or more identical) are also encompassed by the invenuon. 

' ,„ addition to the C. glutamicum MCP nucleotide sequences shown in Appendtx 
A it will be appreciated by those of ordinary skill in the art that DNA sequence 
polymorphisms that lead to changes in the amino acid sequences of MCP protetns may 
0 exist within a population (e.g.,t he Cglutamicum population). Such genehc 

polymorphism in the MCP gene may exist among individuals within a population due to 
natural variation. As used herein, the terms "gene" and "recombinant gene refer to 
nucleic acid molecules comprising an open reading frame encoding an MCP protem 
preferably a C. glutamic MCP protein. Such natural variations can typtcaUy result m 
15 1-5% variance in the nucleotide sequence of the MCP gene. Any and all such 

nucleotide variations and resulting amino acid polymorphisms in MCP that are the result 
of natural variation and that do not alter the functional activity of MCP protems are 
intended to be within the scope of the invention. 

Nucleic acid molecules corresponding to natural variants and non-C. glu— 
20 homologues of the C. glutamicum MCP DNA of the invention can be isolated based on 
their homology to the C glutamic MCP nucleic acid disclosed herem usmg the C 
glutamic** DNA, or a portion thereof, as a hybridization probe accordmg to standard 
hybridization techniques under stringent hybridization conditions. Accordmgly, m 
another embodiment, an isolated nucleic acid molecule of the invention ,s a. leas 15 
25 nucleotides in length and hybridizes under stringent conditions to the nucleic acd 
molecule comprising a nucleotide sequence of Appendix A. In other embodtments, the 
nucleic acid is a, least 30, 50, 100, 250 or more nucleotides in length. As used herem, 
the term "hybridizes under stringent conditions" is intended to describe condttions for 
hybridization and washing under which nucleotide sequences at leas. 60% homologous 
30 .o each o.her typically remain hybridized to each other. Preferably, *e condttions are 
such that sequences a. leas, about 65%, more preferably a. leas. abou. 70%, and even 
more preferably at leas. abou. 75% or more homologous to each other typ.cally remam 
hybridized .o each o.her. Such stringent conditions are known .o those of ordmary sktll 
in the art and can be found in Current Protocols in Molecular Biology, John Wtley & 
35 Sons, N.Y. (1989), 6.3.1-6.3.6. A preferred, non-limiting example of stringent 

hybridization conditions are hybridization in 6X sodium chloride sc4,um c.trs e SSC) 
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Preferably, an isolated nucleic acid molecule of the invention that hybridizes under 
stringent conditions to a sequence of Appendix A corresponds to a naturally-occurring 
nucleic acid molecule. As used herein, a •'naturally-occurring" nucleic acid molecule 
refers-to-anWA-or-DNA-moleeule-havinganuGleotide-sequence-that-occurs-in-nature_ 



(e.g., encodes a natural protein). In one embodiment, the nucleic acid encodes a natural 
C. glutamicum MCP protein. 

In addition to naturally-occurring variants of the MCP sequence that may exist in 
the population, one of ordinary skill in the art will further appreciate that changes can be 
introduced by mutation into a nucleotide sequence of Appendix A, thereby leading to 
changes in the amino acid sequence of the encoded MCP protein, without altering the 
functional ability of the MCP protein. For example, nucleotide substitutions leading to 
amino acid substitutions at "non-essential" amino acid residues can be made in a 
sequence of Appendix A. A "non-essential" amino acid residue is a residue that can be 
altered from the wild-type sequence of one of the MCP proteins (Appendix B) without 
altering the activity of said MCP protein, whereas an "essential" amino acid residue is 
required for MCP protein activity. Other amino acid residues, however, (e.g., those that 
are not conserved or only semi-conserved in the domain having MCP activity) may not 
be essential for activity and thus are likely to be amenable to alteration without altering 
MCP activity. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding MCP proteins that contain changes in amino acid residues that are not essential 
for MCP activity. Such MCP proteins differ in amino acid sequence from a sequence 
contained in Appendix B yet retain at least one of the MCP activities described herein. 
In one embodiment, the isolated nucleic acid molecule comprises a nucleotide sequence 
encoding a protein, wherein the protein comprises an amino acid sequence at least about 
50% homologous to an amino acid sequence of Appendix B and is able to modulate the 
yield, production, and/or efficiency of production of one or more fine chemicals from C. 
glutamicum, to degrade hydrocarbons, to oxidize terpenoids, to serve as a target for drug 
development, or to serve as an identifying marker for C. glutamicum or related 
organisms. Preferably, the protein encoded by the nucleic acid molecule is at least about 
50-60% homologous to one of the sequences in Appendix B, more preferably at least 
about 60-70% homologous to one of the sequences in Appendix B, even more 
preferably at least about 70-80%, 80-90%, 90-95% homologous to one of the sequences 
in Appendix B, and most preferably at least about 96%, 97%, 98%, or 99% homologous 
to one of the sequences in Appendix B. 

(p q one of 
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»re aliened for optimal comparison purposes (e.g., gaps can be introduced in 
the sequence o on ^ conesponding ^ acld 

pcsrtion homology beMeen ^ ^ 
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In addition to the nucleic acid molecules encoding MCP proteins described 
above, another aspect of the invention pertains to isolated nucleic acid molecules which 
are antisense thereto. An "antisense" nucleic acid comprises a nucleotide sequence 

which-is-complementary-to-a-sense-nucleic-acid-encoding-a-protein,-e.^.. 5 

5 complementary to the coding strand of a double-stranded cDNA molecule or 

complementary to an mRNA sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense nucleic acid can be 
complementary to an entire MCP coding strand, or to only a portion thereof. In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 

1 0 coding strand of a nucleotide sequence encoding an MCP protein. The term "coding 
region" refers to the region of the nucleotide sequence comprising codons which are 
translated into amino acid residues (e.g., the entire coding region of SEQ ID NO. 1 
(RXN01638) comprises nucleotides 1 to900). In another embodiment, the antisense 
nucleic acid molecule is antisense to a "noncoding region" of the coding strand of a 

1 5 nucleotide sequence encoding MCP. The term "noncoding region" refers to 5' and 3' 
sequences which flank the coding region that are not translated into amino acids (i.e., 
also referred to as 5' and 3' untranslated regions). 

Given the coding strand sequences encoding MCP disclosed herein (e.g., the 
sequences set forth in Appendix A), antisense nucleic acids of the invention can be 

20 designed according to the rules of Watson and Crick base pairing. The antisense nucleic 
acid molecule can be complementary to the entire coding region of MCP mRNA, but 
more preferably is an oligonucleotide which is antisense to only a portion of the coding 
or noncoding region of MCP mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation start site of MCP mRNA. An 

25 antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 40, 45 or 
50 nucleotides in length. An antisense nucleic acid of the invention can be constructed 
by chemical synthesis and enzymatic ligation reactions using procedures known in the 
art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 
chemically synthesized using naturally occurring nucleotides or variously modified 

30 nucleotides designed to increase the biological stability of the molecules or to increase 
the physical stability of the duplex formed between the antisense and sense nucleic 
acids, e.g., phosphorothioate derivatives and acridine substituted nucleotides can be 
used. Examples of modified nucleotides which can be used to generate the antisense 
nucleic acid include 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, 

35 hypoxanthine, xanthine, 4-acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5- 
carboxymethylaminomethyl-2-thiouridine, 5-carboxymethylaminomethyluracil, 
dihydrouracil, beta-D-galactosylqueosine, inosine, N6-isopentenyladenine, 1- 
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methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2- 
methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7-methylguanine, 5- 
methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
-maimosylqueosu^ 

isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 
2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil 5 3-(3- 
amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 
antisense nucleic acid can be produced biologically using an expression vector into 
which a nucleic acid has been subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an antisense orientation to a target 
nucleic acid of interest, described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
to a cell or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding an MCP protein to thereby inhibit expression of the 
protein, e.g., by inhibiting transcription and/or translation. The hybridization can be by 
conventional nucleotide complementarity to form a stable duplex, or, for example, in the 
case of an antisense nucleic acid molecule which binds to DNA duplexes, through 
specific interactions in the major groove of the double helix. The antisense molecule can 
be modified such that it specifically binds to a receptor or an antigen expressed on a 
selected cell surface, e.g., by linking the antisense nucleic acid molecule to a peptide or 
an antibody which binds to a cell surface receptor or antigen. The antisense nucleic acid 
molecule can also be delivered to cells using the vectors described herein. To achieve 
sufficient intracellular concentrations of the antisense molecules, vector constructs in 
which the antisense nucleic acid molecule is placed under the control of a strong 
prokaryotic, viral, or eukaryotic promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention 
is an cc-anomeric nucleic acid molecule. An a-anomeric nucleic acid molecule forms 
specific double-stranded hybrids with complementary RNA in which, contrary to the 
usual P-units, the strands run parallel to each other (Gaultier et al. (1987) Nucleic Acids. 
Res. 15:6625-6641). The antisense nucleic acid molecule can also comprise a 2'-o- 
methylribonucleotide (Inoue et al. (1987) Nucleic Acids Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al. (1987) FEBSLett. 215:327-330). 

In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity which are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they 
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(described in Haselhoff and Gerlach (1988) Nature 334:585-591)) can be used to 
catalytically cleave MCP mRNA transcripts to thereby inhibit translation of MCP 
mRNA. A ribozyme having specificity for an MCP-encoding nucleic acid can be 

— — designed-based-upon-^ 
5 SEQ ID NO. 1 (RXN01368) in Appendix A). For example, a derivative of a 

Tetrahymena L-19 IVS RNA can be constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence to be cleaved in an MCP- 
encoding mRNA. See, e.g., Cech et al U.S. Patent No. 4,987,071 and Cech et al U.S. 
Patent No. 5,1 16,742. Alternatively, MCP mRNA can be used to select a catalytic RNA 

10 having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., Battel, 
D. and Szostak, J.W. (1993) Science 261:1411-1418. 

Alternatively, MCP gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of an MCP nucleotide sequence {e.g., 
an MCP promoter and/or enhancers) to form triple helical structures that prevent 

1 5 transcription of an MCP gene in target cells. See generally, Helene, C. (1991) 

Anticancer Drug Des. 6(6):569-84; Helene, C. et al (1992) Ann. N.Y.Acad. Sci. 660:27- 
36; and Maher, L.J. (1992) Bioassays 14(12):807-15. 

B. Recombinant Expression Vectors and Host Cells 

20 Another aspect of the invention pertains to vectors, preferably expression 

vectors, containing a nucleic acid encoding an MCP protein (or a portion thereof). As 
used herein, the term "vector" refers to a nucleic acid molecule capable of transporting 
another nucleic acid to which it has been linked. One type of vector is a "plasmid", 
which refers to a circular double stranded DNA loop into which additional DNA 

25 segments can be ligated. Another type of vector is a viral vector, wherein additional 
DNA segments can be ligated into the viral genome. Certain vectors are capable of 
autonomous replication in a host cell into which they are introduced (e.g., bacterial 
vectors having a bacterial origin of replication and episomal mammalian vectors). Other 
vectors (e.g., non-episomal mammalian vectors) are integrated into the genome of a host 

30 cell upon introduction into the host cell, and thereby are replicated along with the host 
genome. Moreover, certain vectors are capable of directing the expression of genes to 
which they are operatively linked. Such vectors are referred to herein as "expression 
vectors". In general, expression vectors of utility in recombinant DNA techniques are 
often in the form of plasmids. In the present specification, "plasmid" and "vector" can 

35 be used interchangeably as the plasmid is the most commonly used form of vector. 
However, the invention is intended to include such other forms of expression vectors, 
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such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno- 
associated viruses), which serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 

the invention in a fonn suitable for-expression of4he nucleic acid4n a host cell,-which — 

5 means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operatively linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 

1 0 of the nucleotide sequence (e.g. , in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, repressor binding sites, activator binding 
sites, enhancer regions and other expression control elements (e.g., terminators, other 
elements of mRNA secondary structure, or polyadenylation signals). Such regulatory 

1 5 sequences are described, for example, in Goeddel; Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Regulatory 
sequences include those which direct constitutive expression of a nucleotide sequence in 
many types of host cell and those which direct expression of the nucleotide sequence 
only in certain host cells. Preferred regulatory sequences are, for example, promoters 

20 such as cos-, tec-, trp-, tet-, trp-tet-, lpp-, lac-, lpp-lac-, lacl q -, T7-, T5-, T3-, gal-, trc-, 
ara-, SP6-, amy, SP02, or I P L , which are used preferably in bacteria. Additional 
regulatory sequences are, for example, promoters from yeasts and fungi, such as ADC1, 
MFa, AC, P-60, CYC1, GAPDH, TEF, rp28, ADH, promoters from plants such as 
CaMV/35S, SSU", OCS, lib4, usp, STLS1, B33,"nos of ubiquitin- or phaseolin- 

25 promoters. It is also possible to use artificial promoters. It will be appreciated by those 
of ordinary skill in the art that the design of the expression vector can depend on such 
factors as the choice of the host cell to be transformed, the level of expression of protein 
desired, etc. The expression vectors of the invention can be introduced into host cells to 
thereby produce proteins or peptides, including fusion proteins or peptides, encoded by 

30 nucleic acids as described herein (e.g. , MCP proteins, mutant forms of MCP proteins, 
fusion proteins, etc.). 

The recombinant expression vectors of the invention can be designed for 
expression of MCP proteins in prokaryotic or eukaryotic cells. For example, MCP 
genes can be expressed in bacterial cells such as C. glutamicum, insect cells (using 

35 baculovirus expression vectors), yeast and other fungal cells (see Romanos, M.A. et al. 
(1992) "Foreign gene expression in yeast: a review", Yeast 8: 423-488; van den Hondel, 
/-i a \/i t t cioon "Ua+orrvlrtfi/Mio npnp pvnrpccif>ti in filampntnm flincn" in 1 More 
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Gene Manipulations in Fungi, J.W. Bennet & L.L. Lasure, eds., p. 396-428: Academic 
Press- San Diego; and van den Hondel, C.A.M.J.J. & Punt, P.J. (1991) "Gene transfer 
systems and vector development for filamentous fungi, in: Applied Molecular Genetics 

ofFungi-PeberdyrJTF^ 

5 algae and multicellular plant cells (see Schmidt, R. and Willmitzer, L. (1988) High 
efficiency Agrobacterium tumefactiens -mediated transformation of Arabidopsis 
thaliana leaf and cotyledon explants" Plant Cell Rep. : 583-586), or mammalian cells. 
Suitable host cells are discussed further in Goeddel, Gene Expression Technology: 
Methods in Enzymology 185, Academic Press, San Diego, CA (1990). Alternatively, the 
0 recombinant expression vector can be transcribed and translated in vitro, for example 
using T7 promoter regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most often carried out with vectors 
containing constitutive or inducible promoters directing the expression of either fusion 
or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
5 encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, m fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
20 moiety and the recombinant protein to enable separation of the recombinant protein 
from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 
and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 

Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, 
D B and Johnson, K.S. (1988) Gene 67:31-40), pMAL (New England Biolabs, Beverly, 
25 MA) and P RIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase 
(GST) maltose E binding protein, or protein A, respectively, to the target recombinant 
protein In one embodiment, the coding sequence of the MCP protein is cloned into a 
pGEX expression vector to create a vector encoding a fusion protein comprising, from 
the N-terminus to the C-terminus, GST-thrombin cleavage site-X protein. The fusion 
30 protein can be purified by affinity chromatography using glutathione-agarose resin. 
Recombinant MCP protein unfused to GST can be recovered by cleavage of the fusion 

protein with thrombin. 

Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amann et al, (1988) Gene 69:301-315), P LG338, pACYC184, P BR322, P UC18, 
35 P UC19, P KC30, P Rep4, pHSl, P HS2, P PLc236, pMBL24, P LG200, P UR290, pIN- 
III1 13-B1 Agtl 1, pBdCl, and pET lid (Studier et al, Gene Expression Technology: 
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Pouwels et al, eds. (1985) Cloning Vectors. Elsevier: New York IBSN 0 444 904018). 
Target gene expression from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fusion promoter. Target gene expression from the 

___pETJ_ldjy_ector_^ 
5 a coexpressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by 
host strains BL21(DE3) or HMS174(DE3) from a resident X prophage harboring a T7 
gnl gene under the transcriptional control of the lacUV 5 promoter. For transformation 
of other varieties of bacteria, appropriate vectors may be selected. For example, the 
plasmids pIJlOl, pIJ364, pIJ702 and pIJ361 are known to be useful in transforming 

1 0 Streptomyces, while plasmids pUB 1 1 0, pC 1 94, or pBD2 1 4 are suited for transformation 
of Bacillus species. Several plasmids of use in the transfer of genetic information into 
Corynebacterium include pHM1519, pBLl, pSA77, or pAJ667 (Pouwels et al, eds. 
(1985) Cloning Vectors. Elsevier: New York IBSN 0 444 904018). 

One strategy to maximize recombinant protein expression is to express the 

1 5 protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, S., Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy is to alter the nucleic acid sequence of the nucleic acid to be inserted into an 
expression vector so that the individual codons for each amino acid are those 

20 preferentially utilized in the bacterium chosen for expression, such as C glutamicum 
(Wada et al (1992) Nucleic Acids Res. 20:21 1 1-2118). Such alteration of nucleic acid 
sequences of the invention can be carried out by standard DNA synthesis techniques. 

In another embodiment, the MCP protein expression vector is a yeast expression 
vector. Examples of vectors for expression in yeast S. cerevisiae include pYepSec l 

25 (Baldari, et al, (1987) Embo J, 6:229-234), 2 \i, pAG-1, Yep6, Yepl3, pEMBLYe23, 
pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et al, (1987) 
Gene 54:1 13-123), and pYES2 (Invitrogen Corporation, San Diego, CA). Vectors and 
methods for the construction of vectors appropriate for use in other fungi, such as the 
filamentous fungi, include those detailed in: van den Hondel, C.A.M.J.J. & Punt, P.J. 

30 (1991) "Gene transfer systems and vector development for filamentous fungi, in: 
Applied Molecular Genetics of Fungi, J.F. Peberdy, et al, eds., p. 1-28, Cambridge 
University Press: Cambridge, and Pouwels et al, eds. (1985) Cloning Vectors. Elsevier: 
New York (IBSN 0 444 9040 1 8). 

Alternatively, the MCP proteins of the invention can be expressed in insect cells 

35 using baculovirus expression vectors. Baculovirus vectors available for expression of 

Hi 
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(1983) Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow and Summers (1989) 
Virology 170:31-39). 

In another embodiment, the MCP proteins of the invention may be expressed in 

unicellular_plantxellsXsuch_as_algae^ 

5 spermatophytes, such as crop plants). Examples of plant expression vectors include 
those detailed in: Becker, D., Kemper, E., Schell, J. and Masterson, R. (1992) "New 
plant binary vectors with selectable markers located proximal to the left border", Plant 
Mol. Biol. 20: 1 195-1197; and Bevan, M.W. (1984) "Binary Agrobacterium vectors for 
plant transformation", Nucl. Acid. Res. 12: 871 1-8721, and include pLGV23, pGHlac+, 

10 pBIN19, pAK2004, and pDH51 (Pouwels et al, eds. (1985) Cloning Vectors. Elsevier: 
New York IBSN 0 444 904018). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, B. (1987) Nature 329:840) and pMT2PC 

15 (Kaufman et al. (1987) EMBOJ. 6:187-195). When used in mammalian cells, the 
expression vector's control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 16 and 17 of Sambrook, J., Fritsh, E. F., 

20 and Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed, Cold Spring 
Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989. 

In another embodiment, the recombinant mammalian expression vector is 
capable of directing expression of the nucleic acid preferentially in a particular cell type 

25 (e.g. , tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al. 
(1987) Genes Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1988) 
Adv. Immunol. 43:235-275), in particular promoters of T cell receptors (Winoto and 

30 Baltimore (1 989) EMBO J. 8:729-733) and immunoglobulins (Banerji et al. ( 1 983) Cell 
33:729-740; Queen and Baltimore (1983) Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle (1989) PNAS 86:5473-5477), 
pancreas-specific promoters (Edlund et al. (1985) Science 230:912-916), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,316 and 

3 5 European Application Publication No. 264, 1 66). Developmentally-regulated promoters 
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Science 249:374-379) and the a-fetoprotein promoter (Campes and Tilghman (1989) 

Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
-DNA-molecule of-the invention-cloned-into-the-expression-vector-in.an.antisense 



5 orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in 
a manner which allows for expression (by transcription of the DNA molecule) of an 
RNA molecule which is antisense to MCP mRNA. Regulatory sequences operatively 
linked to a nucleic acid cloned in the antisense orientation can be chosen which direct 
the continuous expression of the antisense RNA molecule in a variety of cell types, for 
10 instance viral promoters and/or enhancers, or regulatory sequences can be chosen which 
direct constitutive, tissue specific or cell type specific expression of antisense RNA. 
The antisense expression vector can be in the form of a recombinant plasmid, phagemid 
or attenuated virus in which antisense nucleic acids are produced under the control of a 
high efficiency regulatory region, the activity of which can be determined by the cell 
15 type into which the vector is introduced. For a discussion of the regulation of gene 
expression using antisense genes see Weintraub, H. et al. (1986) "Antisense RNA as a 
molecular tool for genetic analysis", Reviews - Trends in Genetics, Vol. 1(1). 

Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
20 "recombinant host cell" are used interchangeably herein. It is understood that such 

terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as used 
25 herein. 

A host cell can be any prokaryotic or eukaryotic cell. For example, an MCP 
protein can be expressed in bacterial cells such as C. glutamicum, insect cells, yeast or 
mammalian cells (such as Chinese hamster ovary cells (CHO) or COS cells). Other 
suitable host cells are known to those of ordinary skill in the art. Microorganisms related 

30 to Corynebacterium glutamicum which may be conveniently used as host cells for the 
nucleic acid and protein molecules of the invention are set forth in Table 3. 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
"transformation", "transfection", "conjugation" and "transduction" are intended to refer 

35 to a variety of art-recognized techniques for introducing foreign nucleic acid (e.g., linear 
DNA or RNA (e.g., a linearized vector or a gene construct alone without a vector) or 

A nhflopmin 
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transposon or other DNA) into a host cell, including using natural competence, chemical 
mediated transfer, calcium phosphate or calcium chloride co-precipitation, DEAE- 
dextran-mediated transfection, lipofection, or electroporation. Suitable methods for 

——transforming^^ 

5 Cloning: A Laboratory Manual 2nd, ed, Cold Spring Harbor Laboratory, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY, 1989), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 

10 integrants, a gene that encodes a selectable marker {e.g., resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 
hygromycin and methotrexate. Nucleic acid encoding a selectable marker can be 
introduced into a host cell on the same vector as that encoding an MCP protein or can be 

1 5 introduced on a separate vector. Cells stably transfected with the introduced nucleic 
acid can be identified by, for example, drug selection {e.g., cells that have incorporated 
the selectable marker gene will survive, while the other cells die). 

To create a homologous recombinant microorganism, a vector is prepared which 
contains at least a portion of an MCP gene into which a deletion, addition or substitution 

20 has been introduced to thereby alter, e.g., functionally disrupt, the MCP gene. 

Preferably, this MCP gene is a Corynebacterium glutamicum MCP gene, but it can be a 
homologue from a related bacterium or even from a mammalian, yeast, or insect source. 
In a preferred embodiment, the vector is designed such that, upon homologous 
recombination, the endogenous MCP gene is functionally disrupted {ire. , no longer 

25 encodes a functional protein; also referred to as a "knock out" vector). Alternatively, 
the vector can be designed such that, upon homologous recombination, the endogenous 
MCP gene is mutated or otherwise altered but still encodes functional protein {e.g., the 
upstream regulatory region can be altered to thereby alter the expression of the 
endogenous MCP protein). In the homologous recombination vector, the altered portion 

30 of the MCP gene is flanked at its 5' and V ends by additional nucleic acid of the MCP 
gene to allow for homologous recombination to occur between the exogenous MCP 
gene carried by the vector and an endogenous MCP gene in a microorganism. The 
additional flanking MCP nucleic acid is of sufficient length for successful homologous 
recombination with the endogenous gene. Typically, less than one kilobase of flanking 

35 DNA (both at the 5' and 3' ends) is included in the vector (see e.g. , Thomas, K.R., and 
Capecchi, M.R. (1987) Cell 51 : 503 for a description of homologous recombination 
vectors). The vector is introduced into a microorganism {e.g., by electroporation) and 
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cells in which the introduced MCP gene has homologously recombined with the 
endogenous MCP gene are selected, using art-known techniques. 

In another embodiment, recombinant microorganisms can be produced which 

contain selected*^ 

5 For example, inclusion of an MCP gene on a vector placing it under control of the lac 
operon permits expression of the MCP gene in the presence of IPTG. Such regulatory 

systems are well known in the art. 

In another embodiment, an endogenous MCP gene in a host cell is disrupted 
(e g. by homologous recombination or other genetic means known in the art) such that 

10 expression of its protein product does not occur. In another embodiment, an endogenous 
or introduced MCP gene in a host cell has been altered by one or more point mutations, 
deletions, or inversions, but still encodes a functional MCP protein. In still another 
embodiment, one or more of the regulatory regions (e.g., a promoter, repressor, or 
inducer) of an MCP gene in a microorganism has been altered (e.g. , by deletion, 

15 truncation, inversion, or point mutation) such that the expression of the MCP gene is 
modulated. One of ordinary skill in the art will appreciate that host cells containing 
more than one of the described MCP gene and protein modifications may be readily 
produced using the methods of the invention, and are meant to be included in the present 
invention. 

20 A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 

culture, can be used to produce (/.*, express) an MCP protein. Accordingly, the 
invention further provides methods for producing MCP proteins using the host cells of 
the invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding an MCP protein has 

25 been introduced, or into which genome has been introduced a gene encoding a wild-type 
or altered MCP protein) in a suitable medium until MCP protein is produced. In another 
embodiment, the method further comprises isolating MCP proteins from the medium or 
the host cell. 



30 C. Isolated MCP Proteins 

Another aspect of the invention pertains to isolated MCP proteins, and 
biologically active portions thereof. An "isolated" or "purified" protein or biologically 
active portion thereof is substantially free of cellular material when produced by 
recombinant DNA techniques, or chemical precursors or other chemicals when 
chemically synthesized. The language "substantially free of cellular material" includes 
preparations of MCP protein in which the protein is separated from cellular components 
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language "substantially free of cellular material" includes preparations of MCP protein 
having less than about 30% (by dry weight) of non-MCP protein (also referred to herein 
as a "contaminating protein"), more preferably less than about 20% of non-MCP protein, 

—-still-more-preferably-less than-about40% otnon-MCP-protein^and-most-preferably^less^ 

5 than about 5% non-MCP protein. When the MCP protein or biologically active portion 
thereof is recombinantly produced, it is also preferably substantially free of culture 
medium, i.e., culture medium represents less than about 20%, more preferably less than 
about 10%, and most preferably less than about 5% of the volume of the protein 
preparation. The language "substantially free of chemical precursors or other 

1 0 chemicals" includes preparations of MCP protein in which the protein is separated from 
chemical precursors or other chemicals which are involved in the synthesis of the 
protein. In one embodiment, the language "substantially free of chemical precursors or 
other chemicals" includes preparations of MCP protein having less than about 30% (by 
dry weight) of chemical precursors or non-MCP chemicals, more preferably less than 

15 about 20% chemical precursors or non-MCP chemicals, still more preferably less than 
about 10% chemical precursors or non-MCP chemicals, and most preferably less than 
about 5% chemical precursors or non-MCP chemicals. In preferred embodiments, 
isolated proteins or biologically active portions thereof lack contaminating proteins from 
the same organism from which the MCP protein is derived. Typically, such proteins are 

20 produced by recombinant expression of, for example, a C glutamicum MCP protein in a 
microorganism such as C. glutamicum. 

An isolated MCP protein or a portion thereof of the invention is able to modulate 
the yield, production, and/or efficiency of production of one or more fine chemicals 
from C glutamicum, to degrade hydrocarbons, to" oxidize terpenoids, to serve as a target 

25 for drug development, or to serve as an identifying marker for C glutamicum or related 
organisms. In preferred embodiments, the protein or portion thereof comprises an 
amino acid sequence which is sufficiently homologous to an amino acid sequence of 
Appendix B such that the protein or portion thereof maintains the ability to modulate the 
yield, production, and/or efficiency of production of one or more fine chemicals from C. 

30 glutamicum, to degrade hydrocarbons, to oxidize terpenoids, to serve as a target for drug 
development, or to serve as an identifying marker for C. glutamicum or related 
organisms. The portion of the protein is preferably a biologically active portion as 
described herein. In another preferred embodiment, an MCP protein of the invention 
has an amino acid sequence shown in Appendix B. In yet another preferred 

35 embodiment, the MCP protein has an amino acid sequence which is encoded by a 

nucleotide sequence which hybridizes, e.g., hybridizes under stringent conditions, to a 
nucleotide sequence of Appendix A. In still another preferred embodiment, the MCP 
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protein has an amino acid sequence which is encoded by a nucleotide sequence that is a, 
least about SO* 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, or 60%, preferably 
at least about 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, or 70%, more 
preferably-at-least-aboutM^ 



81% 82% 83%, 84%, 85%, 86%, 87%, 88%, 89%, or 90%, or 91%, 92%, 93%, 94%, 
and even more preferably a. least about 95%, 96%, 97%, 98%, 99%or more homologous 
to one of the nucleic acid sequences of Appendix A, or a portion thereof. Ranges and 
identity values intermediate to the above-recited values, (e.g., 70-90% identical or 80- 
95% identical) are also intended to be encompassed by the present invention. For 
example, ranges of identity values using a combination of any of the above values 
recited as upper and/or lower limits are intended to be included. The preferred MCP 
proteins of the present invention also preferably possess at least one of the MCP 
activities described herein. For example, a preferred MCP protein of the present 
invention includes an amino acid sequence encoded by a nucleotide sequence which 
hybridizes, e.g., hybridizes under stringent conditions, to a nucleotide sequence of 
Appendix A, and which is able to modulate the yield, production, and/or efficiency of 
production of one or more fine chemicals from C. glutamic, to degrade hydrocarbons, 
to oxidize terpenoids, to serve as a target for drug development, or to serve as an 
identifying marker for C. glutamicum or related organisms. 

In other embodiments, the MCP protein is substantially homologous to an ammo 
acid sequence of Appendix B and retains the functional activity of the protem of one of 
the sequences of Appendix B yet differs in amino acid sequence due to natural vanatton 
or mutagenesis, as described in detail in subsection 1 above. Accordingly, m another 
embodiment, the MCP protein is a protein which comprises an amino acid sequence ■ 
which is at least about 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, or 
60% preferably a. least about 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, or 
70%' more preferably a. leas, about 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 
or 80% 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, or 90%, or 91%, 92%, 
93% 94% and even more preferably at least about 95%, 96%, 97%, 98%, 99% or more 
, homologous to an entire amino acid sequence of Appendix B and which has at leas, one 
of the MCP activities described herein. Ranges and identity values intermediate to the 
above-recited values, (e.g., 70-90% identical or 80-95% identical) are also intended to 
be encompassed by the present invention. For example, ranges of identity values usmg a 
combination of any of the above values recited as upper and/or lower limits are intended 
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Biologically active portions of an MCP protein include peptides comprising 
amino acid sequences derived from the amino acid sequence of an MCP protein, e.g., an 
amino acid sequence shown in Appendix B or the amino acid sequence of a protein 
homologous4oan-MCP-protein,-which-include-fewer-amino-acids-than-a-full-length 

5 MCP protein or the full length protein which is homologous to an MCP protein, and 
exhibit at least one activity of an MCP protein. Typically, biologically active portions 
(peptides, e.g., peptides which are, for example, 5, 10, 15, 20, 30, 35, 36, 37, 38, 39, 40, 
50, 100 or more amino acids in length) comprise a domain or motif with at least one 
activity of an MCP protein. Moreover, other biologically active portions, in which other 

1 0 regions of the protein are deleted, can be prepared by recombinant techniques and 

evaluated for one or more of the activities described herein. Preferably, the biologically 
active portions of an MCP protein include one or more selected domains/motifs or 
portions thereof having biological activity. 

MCP proteins are preferably produced by recombinant DNA techniques. For 

1 5 example, a nucleic acid molecule encoding the protein is cloned into an expression 
vector (as described above), the expression vector is introduced into a host cell (as 
described above) and the MCP protein is expressed in the host cell. The MCP protein 
can then be isolated from the cells by an appropriate purification scheme using standard 
protein purification techniques. Alternative to recombinant expression, an MCP protein, 

20 polypeptide, or peptide can be synthesized chemically using standard peptide synthesis 
techniques. Moreover, native MCP protein can be isolated from cells (e.g., endothelial 
cells, bacterial cells, fungal cells or other cells), for example using an anti-MCP 
antibody, which can be produced by standard techniques utilizing an MCP protein or 
fragment thereof of this invention. 

25 The invention also provides MCP chimeric or fusion proteins. As used herein, 

an MCP "chimeric protein" or "fusion protein" comprises an MCP polypeptide 
operatively linked to a non-MCP polypeptide. An "MCP polypeptide" refers to a 
polypeptide having an amino acid sequence corresponding to an MCP protein, whereas a 
"non-MCP polypeptide" refers to a polypeptide having an amino acid sequence 

30 corresponding to a protein which is not substantially homologous to the MCP protein, 
e.g., a protein which is different from the MCP protein and which is derived from the 
same or a different organism. Within the fusion protein, the term "operatively linked" is 
intended to indicate that the MCP polypeptide and the non-MCP polypeptide are fused 
in-frame to each other. The non-MCP polypeptide can be fused to the N-terminus or C- 

35 terminus of the MCP polypeptide. For example, in one embodiment the fusion protein 
is a GST-MCP fusion protein in which the MCP sequences are fused to the C-terminus 
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recombinant MCP proteins. In another embodiment, the fusion protein is an MCP 
protein containing a heterologous signal sequence at its N-terminus. In certain host cells 
(e.g., mammalian host cells, bacterial host cells, fungal host cells), expression and/or 
-secretion-of-an-MGP-prote^^ ' 



5 sequence. 

Preferably, an MCP chimeric or fusion protein of the invention is produced by 
standard recombinant DNA techniques. For example, DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with 
conventional techniques, for example by employing blunt-ended or stagger-ended 

1 0 termini for ligation, restriction enzyme digestion to provide for appropriate termini, 
filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to avoid 
undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene 
can be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PCR amplification of gene fragments can be carried out using anchor 

1 5 primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and reamplified to generate a chimeric 
gene sequence (see, for example, Current Protocols in Molecular Biology, eds. Ausubel 
et al. John Wiley & Sons: 1992). Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide). An MCP- 

20 encoding nucleic acid can be cloned into such an expression vector such that the fusion 
moiety is linked in-frame to the MCP protein. 

Homologues of the MCP protein can be generated by mutagenesis, e.g., discrete 
point mutation or truncation of the MCP protein. As used herein, the term "homologue" 
refers to a variant form of the MCP protein which acts as an agonist or antagonist of the 

25 activity of the MCP protein. An agonist of the MCP protein can retain substantially the 
same, or a subset, of the biological activities of the MCP protein. An antagonist of the 
MCP protein can inhibit one or more of the activities of the naturally occurring form of 
the MCP protein, by, for example, competitively binding to a downstream or upstream 
member of a biochemical pathway which includes the MCP protein. 

30 In an alternative embodiment, homologues of the MCP protein can be identified 

by screening combinatorial libraries of mutants, e.g., truncation mutants, of the MCP 
protein for MCP protein agonist or antagonist activity. In one embodiment, a variegated 
library of MCP variants is generated by combinatorial mutagenesis at the nucleic acid 
level and is encoded by a variegated gene library. A variegated library of MCP variants 

35 can be produced by, for example, enzymatically ligating a mixture of synthetic 



4.' 1 \yf/^D 



ATTORNEY D^pET NO.: BGI-129CP 

-40- 



fusion proteins (e.g., for phage display) containing the set of MCP sequences therein. 
There are a variety of methods which can be used to produce libraries of potential MCP 
homologues from a degenerate oligonucleotide sequence. Chemical synthesis of a 

degenerate_gene^sequencexan_be_perfomedjn_an.automatic_^ 

5 synthetic gene then ligated into an appropriate expression vector. Use of a degenerate 
set of genes allows for the provision, in one mixture, of all of the sequences encoding 
the desired set of potential MCP sequences. Methods for synthesizing degenerate 
oligonucleotides are known in the art (see, e.g., Narang, S.A. (1983) Tetrahedron 39:3; 
Itakura etal {\9U)Annu. Rev. Biochem. 53:323; Itakura et al (1984) Science 

10 198:1056; Ike etal (1983) Nucleic Acid Res. 11:477. 

In addition, libraries of fragments of the MCP protein coding can be used to 
generate a variegated population of MCP fragments for screening and subsequent 
selection of homologues of an MCP protein. In one embodiment, a library of coding 
sequence fragments can be generated by treating a double stranded PCR fragment of an 

1 5 MCP coding sequence with a nuclease under conditions wherein nicking occurs only 
about once per molecule, denaturing the double stranded DNA, renaturing the DNA to 
form double stranded DNA which can include sense/antisense pairs from different 
nicked products, removing single stranded portions from reformed duplexes by 
treatment with SI nuclease, and ligating the resulting fragment library into an expression 

20 vector. By this method, an expression library can be derived which encodes N-terminal, 
C-terminal and internal fragments of various sizes of the MCP protein. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 

25 rapid screening of the gene libraries generated by the combinatorial mutagenesis of 
MCP homologues. The most widely used techniques, which are amenable to high 
through-put analysis, for screening large gene libraries typically include cloning the 
gene library into replicable expression vectors, transforming appropriate cells with the 
resulting library of vectors, and expressing the combinatorial genes under conditions in 

30 which detection of a desired activity facilitates isolation of the vector encoding the gene 
whose product was detected. Recrusive ensemble mutagenesis (REM), a new technique 
which enhances the frequency of functional mutants in the libraries, can be used in 
combination with the screening assays to identify MCP homologues (Arkin and 
Yourvan (1992) PNAS 59:781 1-7815; Delgrave et al (1993) Protein Engineering 

35 6(3):327~331). 
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D. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein homologues, fusion 
proteins, primers, vectors, and host cells described herein can be used in one or more of 

— — 1he-follovs4ng-me^^ 
5 of genomes of organisms related to C glutamicum; identification and localization of C 
glutamicum sequences of interest; evolutionary studies; determination of MCP protein 
regions required for function; modulation of an MCP protein activity; modulation of the 
activity of one or more metabolic pathways; and modulation of cellular production of a 
desired compound, such as a fine chemical. 

10 The MCP nucleic acid molecules of the invention have a variety of uses. First, 

they may be used to identify an organism as being Corynebactehum glutamicum or a 
close relative thereof. Also, they may be used to identify the presence of C glutamicum 
or a relative thereof in a mixed population of microorganisms. The invention provides 
the nucleic acid sequences of a number of C glutamicum genes, and probes based 

1 5 thereon; by probing the extracted genomic DNA of a culture of a unique or mixed 

population of microorganisms under stringent conditions with a probe spanning a region 
of a C glutamicum gene which is unique to this organism, one can ascertain whether 
this organism is present. Although Corynebacterium glutamicum itself is 
nonpathogenic, it is related to pathogenic species, such as Corynebacterium diphtherial 

20 Corynebacterium diphtheriae is the causative agent of diphtheria, a rapidly developing, 
acute, febrile infection which involves both local and systemic pathology. In this 
disease, a local lesion develops in the upper respiratory tract and involves necrotic injury 
to epithelial cells; the bacilli secrete toxin which is disseminated through this lesion to 
distal susceptible tissues of the body. Degenerative changes brought about by the 

25 inhibition of protein synthesis in these tissues, which include heart, muscle, peripheral 
nerves, adrenals, kidneys, liver and spleen, result in the systemic pathology of the 
disease. Diphtheria continues to have high incidence in many parts of the world, 
including Africa, Asia, Eastern Europe and the independent states of the former Soviet 
Union. An ongoing epidemic of diphtheria in the latter two regions has resulted in at 

30 least 5,000 deaths since 1990. 

In one embodiment, the invention provides a method of identifying the presence 
or activity of Corynebacterium diphtheriae in a subject. This method includes detection 
of one or more of the nucleic acid or amino acid sequences of the invention (e.g., the 
sequences set forth in Appendix A or Appendix B) in a subject, thereby detecting the 

35 presence or activity of Corynebacterium diphtheriae in the subject. C glutamicum and 
C diphtheriae are related bacteria, and many of the nucleic acid and protein molecules 
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in C glutamicum are homologous to C. diphtheriae nucleic acid and protein molecules, 
and can therefore be used to detect C diphtheriae in a subject. 

To detect the presence of C. glutamicum in a sample, techniques well known in 
the art may be employed. Specifically, the cells in the sam ple ma y o ptionally first be 
5 cultured in a suitable liquid or on a suitable solid culture medium to increase the number 
of cells in the sample. These cells are lysed, and the total DNA content extracted and 
optionally purified to remove debris and protein material which may interfere with 
subsequent analysis. The polymerase chain reaction or a similar technique known in the 
art is performed (for general reference on methodologies commonly used for the 

10 amplification of nucleic acid sequences, see Mullis et al, U.S. Patent No. 4,683,195, 
Mullis et a/., U.S. Patent No. 4,965,188, and Innis, M.A., and Gelfand, D. H., (1989) 
PCR Protocols, A guide to Methods and Applications, Academic Press, p. 3-12, and 
(1988) Biotechnology 6:1 197, and International Patent Application No. WO89/01050) 
in which primers specific to an MCP nucleic acid molecule of the invention are 

1 5 incubated with the nucleic acid sample such that, if present in the sample, that particular 
MCP nucleic acid sequence will be amplified. The particular MCP nucleic acid to be 
amplified is selected based on its uniqueness to the C. glutamicum genome, or to the 
genomes of C glutamicum and only a few closely related bacteria. The presence of the 
desired amplified product is thus indicative of the presence of C glutamicum, or an 

20 organism closely related to C glutamicum. 

Further, the nucleic acid and protein molecules of the invention may serve as 
markers for specific regions of the genome. It is possible, using techniques well known 
in the art, to ascertain the physical location on the C. glutamicum genome of the MCP 
nucleic~acid molecules of the invention, which in turn provides markers on the genome 

25 which can be used to aid in the placement of other nucleic acid molecules and genes on 
the genome map. Also, the nucleic acid molecules of the invention may be sufficiently 
homologous to the sequences of related bacterial species that these nucleic acid 
molecules may similarly permit the construction of a genomic map in such bacteria 
(e.g., Brevibacterium lactofermentum). 

30 The nucleic acid molecules of the invention have utility not only in the mapping 

of the genome, but also for functional studies of C. glutamicum proteins. For example, 
to identify the region of the genome to which a particular C. glutamicum DNA-binding 
protein binds, the C. glutamicum genome could be digested, and the fragments incubated 
with the DNA-binding protein. Those which bind the protein may be additionally probed 

35 with the nucleic acid molecules of the invention, preferably with readily detectable 
labels; binding of such a nucleic acid molecule to the genome fragment enables the 
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multiple times with different enzymes, facilitates a rapid determination of the nucleic 
acid sequence to which the protein binds. 

The MCP nucleic acid molecules of the invention are also useful for 

evolutionary-and-protein-stmetural-studies^T-he-metebolic-processesin-which-the 

5 molecules of the invention participate are utilized by a wide variety of prokaryotic and 
eukaryotic cells; by comparing the sequences of the nucleic acid molecules of the 
present invention to those encoding similar enzymes from other organisms, the 
evolutionary relatedness of the organisms can be assessed. Similarly, such a comparison 
permits an assessment of which regions of the sequence are conserved and which are 

10 not, which may aid in determining those regions of the protein which are essential for 
the functioning of the enzyme. This type of determination is of value for protein 
engineering studies and may give an indication of what the protein can tolerate in terms 
of mutagenesis without losing function. 

The MCP protein molecules of the invention may also be utilized as markers for 

1 5 the classification of an unknown bacterium as C. glutamicum, or for the identification of 
C. glutamicum or closely related bacteria in a sample. For example, using techniques 
well known in the art, cells in a sample may optionally be amplified (e.g., by culturing 
in an appropriate medium) to increase the sample size, and then may be lysed to release 
proteins contained therein. This sample may optionally be purified to remove debris and 

20 nucleic acid molecules which may interfere with subsequent analysis. Antibodies 

specific for a selected MCP protein of the invention may be incubated with the protein 
sample in a typical Western assay format (see, e.g., Ausubel et al, (1988) Current 
Protocols in Molecular Biology, Wiley: New York) in which the antibody will bind to 
its target protein if this protein is present in the sample. An MCP protein is selected for 

25 this type of assay if it is unique or nearly unique to C. glutamicum or C. glutamicum and 
bacteria very closely related to C. glutamicum. Proteins in the sample are then separated 
by gel electrophoresis, and transferred to a suitable matrix, such as nitrocellulose. An 
appropriate secondary antibody having a detectable label (e.g., chemiluminescent or 
colorimetric) is incubated with this matrix, followed by stringent washing. The presence 

30 or absence of the label is indicative of the presence or absence of the target protein in the 
sample. If the protein is present, then this is indicative of the presence of C. 
glutamicum. A similar process enables the classification of an unknown bacterium as C. 
glutamicum; if a panel of proteins specific to C. glutamicum are not detected in protein 
samples prepared from the unknown bacterium, then that bacterium is not likely to be C. 

35 glutamicum. 

The invention provides methods for screening molecules which modulate the 
activity of an MCP protein, either by interacting with the protein itself or a substrate or 
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binding partner of the MCP protein, or by modulating the transcription or translation of 
an MCP nucleic acid molecule of the invention. In such methods, a microorganism 
expressing one or more MCP proteins of the invention is contacted with one or more test 

C ompounds r and-the-effeet-of-each-te^^^ — 

5 of the MCP protein is assessed. 

Genetic manipulation of the MCP nucleic acid molecules of the invention may 
result in the production of MCP proteins having functional differences from the wild- 
type MCP proteins. These proteins may be improved in efficiency or activity, may be 
present in greater numbers in the cell than is usual, or may be decreased in efficiency or 

10 activity. 

Such changes in activity may directly modulate the yield, production, and/or 
efficiency of production of one or more fine chemicals from C. glutamicum. For 
example, by modifying the activity of a protein involved in the biosynthesis or 
degradation of a fine chemical (i.e., through mutagenesis of the corresponding gene), 

1 5 one may directly modulate the ability of the cell to synthesize or to degrade this 
compound, thereby modulating the yield and/or efficiency of production of the fine 
chemical. Similarly, by modulating the activity of a protein which regulates a fine 
chemical metabolic pathway, one may directly influence whether the production of the 
desired compound is up- or down-regulated, either of which will modulate the yield or 

20 efficiency of production of the fine chemical from the cell. 

Indirect modulation of fine chemical production may also result by modifying 
the activity of a protein of the invention (i.e., by mutagenesis of the corresponding gene) 
such that the overall ability of the cell to grow and divide or to remain viable and 
productive is increased. The production of fine chemicals from C. -glutamicum is - - 

25 generally accomplished by the large-scale fermentative culture of these microorganisms, 
conditions which are frequently suboptimal for growth and cell division. By 
engineering a protein of the invention (e.g., a stress response protein, a cell wall protein, 
or proteins involved in the metabolism of compounds necessary for cell growth and 
division to occur, such as nucleotides and amino acids) such that it is better able to 

30 survive, grow, and multiply in such conditions, it may be possible to increase the 

number and productivity of such engineered C. glutamicum cells in large-scale culture, 
which in turn should result in increased yields and/or efficiency of production of one or 
more desired fine chemicals. Further, the metabolic pathways of any cell are necessarily 
interrelated and coregulated. By altering the activity or regulation of any one metabolic 

3 5 pathway in C. glutamicum (i. e. , by altering the activity of one of the proteins of the 

invention which participates in such a pathway), it is possible to concomitantly alter the 
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activity or regulation of other metabolic pathways in this microorganism, which may be 
directly involved in the synthesis or degradation of a fine chemical. 

The aforementioned mutagenesis strategies for MCP proteins to result in 

increased yields of a-fine chemical from ^g/w/fl/w/ci/m are not meant-to be-limiting: 

5 variations on these strategies will be readily apparent to one of ordinary skill in the art. 
Using such strategies, and incorporating the mechanisms disclosed herein, the nucleic 
acid and protein molecules of the invention may be utilized to generate C. glutamicum 
or related strains of bacteria expressing mutated MCP nucleic acid and protein 
molecules such that the yield, production, and/or efficiency of production of a desired 
10 compound is improved. This desired compound may be any natural product of C. 
glutamicum, which includes the final products of biosynthesis pathways and 
intermediates of naturally-occurring metabolic pathways, as well as molecules which do 
not naturally occur in the metabolism of C. glutamicum, but which are produced by a C. 
glutamicum strain of the invention. 

15 

This invention is further illustrated by the following examples which should not 
be construed as limiting. The contents of all references, patent applications, patents, 
published patent applications, Tables, Appendices, and the sequence listing cited 
throughout this application are hereby incorporated by reference. 

20 

Exemplification 

Example 1: Preparation of total genomic DNA of Corynebacterium glutamicum 
ATCC 13032 

25 A culture of Corynebacterium glutamicum (ATCC 1 3032) was grown overnight 

at 30°C with vigorous shaking in BHI medium (Difco). The cells were harvested by 
centrifugation, the supernatant was discarded and the cells were resuspended in 5 ml 
buffer-I (5% of the original volume of the culture — all indicated volumes have been 
calculated for 100 ml of culture volume). Composition of buffer-I: 140.34 g/1 sucrose, 

30 2.46 g/1 MgS0 4 x 7H 2 0, 1 0 ml/1 KH 2 P0 4 solution (1 00 g/1, adjusted to pH 6.7 with 
KOH), 50 ml/1 Ml 2 concentrate (10 g/1 (NH,) 2 S0 4 , 1 g/1 NaCl, 2 g/1 MgS0 4 x 7H 2 0, 
0.2 g/1 CaCl 2 , 0.5 g/1 yeast extract (Difco), 10 ml/1 trace-elements-mix (200 mg/1 FeS0 4 
x H 2 0, 10 mg/1 ZnS0 4 x 7 H 2 0, 3 mg/1 MnCl 2 x 4 H 2 0, 30 mg/1 H 3 B0 3 20 mg/1 CoCl 2 x 
6 H 2 0, 1 mg/1 NiCl 2 x 6 H 2 0, 3 mg/1 Na 2 Mo0 4 x 2 H 2 0, 500 mg/1 complexing agent 

35 (EDTA or critic acid), 100 ml/1 vitamins-mix (0.2 mg/1 biotin, 0.2 mg/1 folic acid, 20 
mg/1 p-amino benzoic acid, 20 mg/1 riboflavin, 40 mg/1 ca-panthothenate, 140 mg/1 
nicotinic acid, 40 mg/1 pyridoxole hydrochloride, 200 mg/1 myo-inositol). Lysozyme 
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was added to the suspension to a final concentration of 2.5 mg/ml. After an 
approximately 4 h incubation at 37°C, the cell wall was degraded and the resulting 
protoplasts are harvested by centrifugation. The pellet was washed once with 5 ml 

buffer-I and once with 5 ml TE-buffer (10 mM Tris-HCl, 1 mM EDTA, pH 8). The 

5 pellet was resuspended in 4 ml TE-buffer and 0.5 ml SDS solution (10%) and 0.5 ml 
NaCl solution (5 M) are added. After adding of proteinase K to a final concentration of 
200 ug/ml, the suspension is incubated for ca.18 h at 37°C. The DNA was purified by 
extraction with phenol, phenol-chloroform-isoamylalcohol and chloroform- 
isoamylalcohol using standard procedures. Then, the DNA was precipitated by adding 

10 1/50 volume of 3 M sodium acetate and 2 volumes of ethanol, followed by a 30 min 
incubation at -20°C and a 30 min centrifugation at 12,000 rpm in a high speed centrifuge 
using a SS34 rotor (Sorvall). The DNA was dissolved in 1 ml TE-buffer containing 20 
ug/ml RNaseA and dialysed at 4°C against 1000 ml TE-buffer for at least 3 hours. 
During this time, the buffer was exchanged 3 times. To aliquots of 0.4 ml of the 

1 5 dialysed DNA solution, 0.4 ml of 2 M LiCl and 0.8 ml of ethanol are added. After a 30 
min incubation at -20°C, the DNA was collected by centrifugation (13,000 rpm, Biofuge 
Fresco, Heraeus, Hanau, Germany). The DNA pellet was dissolved in TE-buffer. DNA 
prepared by this procedure could be used for all purposes, including southern blotting or 
construction of genomic libraries. 

20 

Example 2: Construction of genomic libraries in Escherichia coli of Corynebacterium 
glutamicum ATCC13032. 

Using DNA prepared as described in Example 1, cosmid and plasmid libraries were 
constructed according toknowh and" well established methods {see e.g., Sambrook, J. et al. 
25 (1989) "Molecular Cloning : A Laboratory Manual", Cold Spring Harbor Laboratory Press, 
or Ausubel, F.M. et al. (1994) "Current Protocols in Molecular Biology", John Wiley & 
Sons.) 

Any plasmid or cosmid could be used. Of particular use were the plasmids pBR322 
(Sutcliffe, J.G. (1979) Proc. Natl. Acad. Sci. USA, 75:3737-3741); pACYC177 (Change & 
30 Cohen (1978) J. Bacteriol 134:1 141-1 156), plasmids of the pBS series (pBSSK+, pBSSK- and 
others; Stratagene, LaJolla, USA), or cosmids as SuperCosl (Stratagene, LaJolla, USA) or 
Lorist6 (Gibson, T.J., Rosenthal A. and Waterson, R.H. (1987) Gene 53:283-286. Gene libraries 
specifically for use in C. glutamicum may be constructed using plasmid pSL109 (Lee, H.-S. and 

t" ■ i / 1 r\r\ a \ 7 1 jf* L * 1 r>° A' O^A 
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Example 3: DNA Sequencing and Computational Functional Analysis 

Genomic libraflies as described in Example 2 were used for DNA sequencing 
^cording-to-standard-inethods^^ 



ABI377 sequencing machines (see e.g., Fleischman, R.D. etal. (1995) "Whole-genome 
Random Sequencing and Assembly of Haemophilus Influenzae Rd., Science, 269:496- 
512). Sequencing primers with the following nucleotide sequences were used: 5'- 
GGAA AC AGTATGA CC ATG-3 ' or 5 ' -GTAA AACGACGGCC AGT-3 ' . 

10 Example 4: In vivo Mutagenesis 

In vivo mutagenesis of Corynebacterium glutamicum can be performed by passage of 
plasmid (or other vector) DNA through E. coli or other microorganisms (e.g. Bacillus spp. or 
yeasts such as Saccharomyces cerevisiae) which are impaired in their capabilities to maintain 
the integrity of their genetic information. Typical mutator strains have mutations in the genes 
1 5 for the DNA repair system (e.g. , mutHLS, mutD, mutT, etc.; for reference, see Rupp, W.D. 
(1996) DNA repair mechanisms, in: Escherichia coli and Salmonella, p. 2277-2294, ASM: 
Washington.) Such strains are well known to those of ordinary skill in the art. The use of such 
strains is illustrated, for example, in Greener, A. and Callahan, M. (1994) Strategies 7: 32-34. 

20 Example 5: DNA Transfer Between Escherichia coli and Corynebacterium 
glutamicum 

Several Corynebacterium and Brevibacterium species contain endogenous 
plasmids (as e.g., pHM1519 or pBLl) which replicate autonomously (for review see, e.g., 

Martin, J.F. et al. (1987) Biotechnology, 5 : 137-146). Shuttle vectors for Escherichia coli 

25 and Corynebacterium glutamicum can be readily constructed by using standard vectors for 
E. coli (Sambrook, J. et al. (1989), "Molecular Cloning: A Laboratory Manual", Cold 
Spring Harbor Laboratory Press or Ausubel, F.M. et al. (1994) "Current Protocols in 
Molecular Biology", John Wiley & Sons) to which a origin or replication for and a 
suitable marker from Corynebacterium glutamicum is added. Such origins of replication 

30 are preferably taken from endogenous plasmids isolated from Corynebacterium and 

Brevibacterium species. Of particular use as transformation markers for these species are 
genes for kanamycin resistance (such as those derived from the Tn5 or Tn903 
transposons) or chloramphenicol (Winnacker, E.L. (1987) "From Genes to Clones — 
Introduction to Gene Technology, VCH, Weinheim). There are numerous examples in the 

35 literature of the construction of a wide variety of shuttle vectors which replicate in both E. 
coli and C. glutamicum, and which can be used for several purposes, including gene over- 
expression (for reference, see e.g., Yoshihama, M. et al. (1985) J. Bacteriol. 162:591-597, 



ATTORNEY DO^^T NO.: BGI-129CP 

-48- 



MartinJ.F. et al (1987) Biotechnology, 5:137-146 and Eikmanns, BJ. et al (1991) Gene, 
102:93-98). 

Using standard methods, it is possible to clone a gene of interest into one of the 

shuttle-vectors-described 

5 Corynebacterium glutamicum. Transformation of C. glutamicum can be achieved by 
protoplast transformation (Kastsumata, R. et al (1984) J. BacterioL 159306-31 1), 
electroporation (Liebl, E. et al (1989) FEMS Microbiol Letters, 53:399-303) and in cases 
where special vectors are used, also by conjugation (as described e.g. in Schafer, A et al 
(1990) J. BacterioL 172:1663-1666). It is also possible to transfer the shuttle vectors for 

10 C. glutamicum to E. coli by preparing plasmid DNA from C. glutamicum (using standard 
methods well-known in the art) and transforming it into E. coll This transformation step 
can be performed using standard methods, but it is advantageous to use an Mcr-deficient 
E coli strain, such as NM522 (Gough & Murray (1983) J. Mol Biol 166:1-19). 

Genes may be overexpressed in C. glutamicum strains using plasmids which 

15 comprise pCGl (U.S. Patent No. 4,617,267) or fragments thereof, and optionally the 
gene for kanamycin resistance from TN903 (Grindley, N.D. and Joyce, CM. (1980) 
Proc. Natl Acad. Set USA 77(12): 7176-7180). In addition, genes may be 
overexpressed in C. glutamicum strains using plasmid pSL109 (Lee, H.-S. and A. J. 
Sinskey (1994) J. Microbiol BiotechnoL 4: 256-263). 

20 Aside from the use of replicative plasmids, gene overexpression can also be 

achieved by integration into the genome. Genomic integration in C glutamicum or other 
Corynebacterium or Brevibacterium species may be accomplished by well-known 
methods, such as homologous recombination with genomic region(s), restriction 
endonuclease mediated integration (REMI) (see, e.g., DE Patent 19823834), or through 

25 the use of transposons. It is also possible to modulate the activity of a gene of interest by 
modifying the regulatory regions {e.g., a promoter, a repressor, and/or an enhancer) by 
sequence modification, insertion, or deletion using site-directed methods (such as 
homologous recombination) or methods based on random events (such as transposon 
mutagenesis or REMI). Nucleic acid sequences which function as transcriptional 

30 terminators may also be inserted 3' to the coding region of one or more genes of the 
invention; such terminators are well-known in the art and are described, for example, in 
Winnacker, E.L. (1987) From Genes to Clones - Introduction to Gene Technology. VCH: 
Weinheim. 

35 Example 6: Assessment of the Expression of the Mutant Protein 

Observations of the activity of a mutated protein in a transformed host cell rely on 
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to that of the wild-type protein. A useful method to ascertain the level of transcription of 
the mutant gene (an indicator of the amount of mRNA available for translation to the gene 
product) is to perform a Northern blot (for reference see, for example, Ausubel et al 

— (-1 -988) -Guirent-Protoeols^ 
5 designed to bind to the gene of interest is labeled with a detectable tag (usually radioactive 
or chemiluminescent), such that when the total RNA of a culture of the organism is 
extracted, run on gel, transferred to a stable matrix and incubated with this probe, the 
binding and quantity of binding of the probe indicates the presence and also the quantity 
of mRNA for this gene. This information is evidence of the degree of transcription of the 

10 mutant gene. Total cellular RNA can be prepared from Corynebacterium glutamicum by 
several methods, all well-known in the art, such as that described in Bormann, E.R. et al 
(1992) Mol Microbiol 6: 317-326. 

To assess the presence or relative quantity of protein translated from this mRNA, 
standard techniques, such as a Western blot, may be employed (see, for example, Ausubel 

1 5 et al (1 988) Current Protocols in Molecular Biology, Wiley: New York). In this process, 
total cellular proteins are extracted, separated by gel electrophoresis, transferred to a 
matrix such as nitrocellulose, and incubated with a probe, such as an antibody, which 
specifically binds to the desired protein. This probe is generally tagged with a 
chemiluminescent or colorimetric label which may be readily detected. The presence and 

20 quantity of label observed indicates the presence and quantity of the desired mutant 
protein present in the cell. 



Example 7: Growth of Genetically Modified Corynebacterium glutamicum ^- Media 
25 and Culture Conditions 

Genetically modified Corynebacteria are cultured in synthetic or natural growth 
media. A number of different growth media for Corynebacteria are both well-known and 
readily available (Lieb et al (1989) Appl Microbiol BiotechnoL, 32:205-210; von der 
Osten et al (1998) Biotechnology Letters, 11:11-16; Patent DE 4,120,867; Liebl (1992) 

30 "The Genus Corynebacterium, in: The Procaryotes, Volume II, Balows, A. et al, eds. 
Springer-Verlag). These media consist of one or more carbon sources, nitrogen sources, 
inorganic salts, vitamins and trace elements. Preferred carbon sources are sugars, such as 
mono-, di-, or polysaccharides. For example, glucose, fructose, mannose, galactose, 
ribose, sorbose, ribulose, lactose, maltose, sucrose, raffinose, starch or cellulose serve as 

35 very good carbon sources. It is also possible to supply sugar to the media via complex 
compounds such as molasses or other by-products from sugar refinement. It can also be 
advantageous to supply mixtures of different carbon sources. Other possible carbon 
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sources are alcohols and organic acids, such as methanol, ethanol, acetic acid or lactic 
acid. Nitrogen sources are usually organic or inorganic nitrogen compounds, or materials 
which contain these compounds. Exemplary nitrogen sources include ammonia gas or 
ammonia-salts^^ 



tUllllll/liiu u***i.^j ^ " ^ ~ 

complex nitrogen sources like corn steep liquor, soy bean flour, soy bean protein, yeast 
extract, meat extract and others. 

Inorganic salt compounds which may be included in the media include the 
chloride-, phosphorous- or sulfate- salts of calcium, magnesium, sodium, cobalt, 
molybdenum, potassium, manganese, zinc, copper and iron. Chelating compounds can be 
added to the medium to keep the metal ions in solution. Particularly useful chelating 
compounds include dihydroxyphenols, like catechol or protocatechuate, or organic acids, 
such as citric acid. It is typical for the media to also contain other growth factors, such as 
vitamins or growth promoters, examples of which include biotin, riboflavin, thiamin, folic 
acid, nicotinic acid, pantothenate and pyridoxin. Growth factors and salts frequently 
originate from complex media components such as yeast extract, molasses, corn steep 
liquor and others. The exact composition of the media compounds depends strongly on 
the immediate experiment and is individually decided for each specific case. Information 
about media optimization is available in the textbook "Applied Microbiol. Physiology, A 
Practical Approach (eds. P.M. Rhodes, P.F. Stanbury, IRL Press (1997) pp. 53-73, ISBN 0 
19 963577 3). It is also possible to select growth media from commercial suppliers, like 
standard 1 (Merck) or BHI (grain heart infusion, DIFCO) or others. 

All medium components are sterilized, either by heat (20 minutes at 1.5 bar and 
121°C) or by sterile filtration. The components can either be sterilized together or, if 
necessary, separately. All media components can be present at the beginning of growth, 
or they can optionally be added continuously or batchwise. 

Culture conditions are defined separately for each experiment. The temperature 
should be in a range between 15'C and 45T. The temperature can be kept constant or can 
be altered during the experiment. The pH of the medium should be in the range of 5 to 
8.5, preferably around 7.0, and can be maintained by the addition of buffers to the media. 
An'exemplary buffer for this purpose is a potassium phosphate buffer. Synthetic buffers 
such as MOPS, HEPES, ACES and others can alternatively or simultaneously be used. It 
is also possible to maintain a constant culture pH through the addition of NaOH or 
NH 4 OH during growth. If complex medium components such as yeast extract are utilized, 
the necessity for additional buffers may be reduced, due to the fact that many complex 
compounds have high buffer capacities. If a fermentor is utilized for culturing the micro- 
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The incubation time is usually in a range from several hours to several days. This 
time is selected in order to permit the maximal amount of product to accumulate in the 
broth. The disclosed growth experiments can be carried out in a variety of vessels, such as 

mierotiter-platesrglass^ 

For screening a large number of clones, the microorganisms should be cultured in 
microtiter plates, glass tubes or shake flasks, either with or without baffles. Preferably 
100 ml shake flasks are used, filled with 10% (by volume) of the required growth 
medium. The flasks should be shaken on a rotary shaker (amplitude 25 mm) using a 
speed-range of 100 - 300 rpm. Evaporation losses can be diminished by the maintenance 
of a humid atmosphere; alternatively, a mathematical correction for evaporation losses 

should be performed. 

If genetically modified clones are tested, an unmodified control clone or a control 
clone containing the basic plasmid without any insert should also be tested. The medium 
is inoculated to an OD 60 o of 0.5 - 1 .5 using cells grown on agar plates, such as CM plates 
(10 g/1 glucose, 2,5 g/1 NaCl, 2 g/1 urea, 10 g/1 polypeptone, 5 g/1 yeast extract, 5 g/1 meat 
extract, 22 g/1 NaCl, 2 g/1 urea, 10 g/1 polypeptone, 5 g/1 yeast extract, 5 g/1 meat extract, 
22 g/1 agar, pH 6.8 with 2M NaOH) that had been incubated at 30'C. Inoculation of the 
media is accomplished by either introduction of a saline suspension of C. glutamicum cells 
from CM plates or addition of a liquid preculture of this bacterium. 

Example 8 -In vitro Analysis of the Function of Mutant Proteins 

The determination of activities and kinetic parameters of enzymes is well 
established in the art. Experiments to determine the activity of any given altered 
enzyme must be tailored to the specific activity of the wild-type enzyme, which is well 
within the ability of one of ordinary skill in the art. Overviews about enzymes in 
general, as well as specific details concerning structure, kinetics, principles, methods, 
applications and examples for the determination of many enzyme activities may be 
found, for example, in the following references: Dixon, M., and Webb, E.C., (1979) 
Enzymes. Longmans: London; Fersht, (1985) Enzyme Structure and Mechanism. 
Freeman: New York; Walsh, (1979) Enzymatic Reaction Mechanisms. Freeman: San 
Francisco; Price, N.C., Stevens, L. (1982) Fundamentals of Enzymology. Oxford Univ. 
Press: Oxford; Boyer, P.D., ed. (1983) The Enzymes, 3 rd ed. Academic Press: New 
York; Bisswanger, H., (1994) Enzymkinetik, 2 nd ed. VCH: Weinheim (ISBN 
3527300325); Bergmeyer, H.U., Bergmeyer, J., GraiM, M., eds. (1983-1986) Methods of 
Enzymatic Analysis, 3 rd ed., vol. I-XII, Verlag Chemie: Weinheim; and Ullmann's 
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The activity of proteins which bind to DNA can be measured by several well- 
established methods, such as DNA band-shift assays (also called gel retardation assays). 
The effect of such proteins on the expression of other molecules can be measured using 
reporter-gene assays-(such as that-describedinJ^^ EMBO J. 14: 

5 3895-3904 and references cited therein). Reporter gene test systems are well known and 
established for applications in both pro- and eukaryotic cells, using enzymes such as 
beta-galactosidase, green fluorescent protein, and several others. 

The determination of activity of membrane-transport proteins can be performed 
according to techniques such as those described in Gennis, R.B. (1989) "Pores, 
10 Channels and Transporters", in Biomembranes, Molecular Structure and Function, 
Springer: Heidelberg, p. 85-137; 199-234; and 270-322. 

Example 9: Analysis of Impact of Mutant Protein on the Production of the Desired 
Product 

15 The effect of the genetic modification in C. glutamicum on production of a 

desired compound (such as an amino acid) can be assessed by growing the modified 
microorganism under suitable conditions (such as those described above) and analyzing 
the medium and/or the cellular component for increased production of the desired 
product (i.e. , an amino acid). Such analysis techniques are well known to one of 

20 ordinary skill in the art, and include spectroscopy, thin layer chromatography, staining 
methods of various kinds, enzymatic and microbiological methods, and analytical 
chromatography such as high performance liquid chromatography (see, for example, 
Ullman, Encyclopedia of Industrial Chemistry, vol. A2, p. 89-90 and p. 443-613, VCH: 
Weinheim (1985); Fallon, A. et al, (1987) "Applications of HPLC in Biochemistry" in: 

25 Laboratory Techniques in Biochemistry and Molecular Biology, vol. 17; Rehm et al 
(1993) Biotechnology, vol. 3, Chapter III: "Product recovery and purification", page 
469-714, VCH: Weinheim; Belter, P. A. et al. (1988) Bioseparations: downstream 
processing for biotechnology, John Wiley and Sons; Kennedy, J.F. and Cabral, J.M.S. 
(1992) Recovery processes for biological materials, John Wiley and Sons; Shaeiwitz, 

30 J.A. and Henry, J.D. (1988) Biochemical separations, in: Ulmann's Encyclopedia of 
Industrial Chemistry, vol. B3, Chapter 1 1, page 1-27, VCH: Weinheim; and Dechow, 
FJ. (1989) Separation and purification techniques in biotechnology, Noyes 
Publications.) 

In addition to the measurement of the final product of fermentation, it is also 
35 possible to analyze other components of the metabolic pathways utilized for the 
production of the desired compound, such as intermediates and side-products, to 
determine the overall efficiency of production of the compound. Analysis methods 
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include measurements of nutrient levels in the medium (e.g., sugars, hydrocarbons, 
nitrogen sources, phosphate, and other ions), measurements of biomass composite and 
growth analysis of the production of common metabolites of biosynthetic pathways, and 
measurement-of ^gasses-produced .during-fermentation._S.tandard methods for these „ 



measureinciii-ui ~%az^ <=> 

5 measurements are outlined in Applied Microbial Physiology, A Practical Approach, 
P.M. Rhodes and P.F. Stanbury, eds., IRL Press, p. 103-129; 131-163; and 165-192 
(ISBN: 0199635773) and references cited therein. 

Example 10: Purification of the Desired Product from C glutamicum Culture 

10 Recovery of the desired product from the C. glutamicum cells or supernatant of 

the above-described culture can be performed by various methods well known in the art. 
If the desired product is not secreted from the cells, the cells can be harvested from the 
culture by low-speed centrifugation, the cells can be lysed by standard techniques, such 
as mechanical force or sonication. The cellular debris is removed by centrifugation, and 
15 the supernatant fraction containing the soluble proteins is retained for further 

purification of the desired compound. If the product is secreted from the C. glutamicum 
cells, then the cells are removed from the culture by low-speed centrifugation, and the 
supernate fraction is retained for further purification. 

The supernatant fraction from either purification method is subjected to 
20 chromatography with a suitable resin, in which the desired molecule is either retained on 
a chromatography resin while many of the impurities in the sample are not, or where the 
impurities are retained by the resin while the sample is not. Such chromatography steps 
may be repeated as necessary, using the same or different chromatography resins. One 
of ordinary skill in the art would be well-versed in the selection of appropriate 
25 chromatography resins and in their most efficacious application for a particular molecule 
to be purified. The purified product may be concentrated by filtration or ultrafiltration, 
and stored at a temperature at which the stability of the product is maximized. 

There are a wide array of purification methods known to the art and the 
preceding method of purification is not meant to be limiting. Such purification 
30 techniques are described, for example, in Bailey, J.E. & Ollis, D.F. Biochemical 
Engineering Fundamentals, McGraw-Hill: New York (1986). 

The identity and purity of the isolated compounds may be assessed by techniques 
standard in the art. These include high-performance liquid chromatography (HPLC), 
spectroscopic methods, staining methods, thin layer chromatography, NIRS, enzymatic 
35 assay, or microbiologically. Such analysis methods are reviewed in: Patek et al. (1994) 
Avvl. Environ. Microbiol. 60: 133-140; Malakhova.r al. (1996) Biotekhnologiya 11: 27- 
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of Industrial Chemistry, (1996) vol. A27, VCH: Weinheim, p. 89-90, p. 521-540, p. 540- 
547, p. 559-566, 575-581 and p. 581-587; Michal, G. (1999) Biochemical Pathways: An 
Atlas of Biochemistry and Molecular Biology, John Wiley and Sons; Fallon, A. et al. 

(4987-)-Applications of-HPLCin3iochemistry_in:Xab_ oratory Techniques in 

5 Biochemistry and Molecular Biology, vol. 17. 

Example 11: Analysis of the Gene Sequences of the Invention 

The comparison of sequences and determination of percent homology between 
two sequences are art-known techniques, and can be accomplished using a mathematical 

0 algorithm, such as the algorithm of Karlin and Altschul (1 990) Proc. Natl. Acad. Set 
USA 87:2264-68, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
90:5873-77. Such an algorithm is incorporated into the NBLAST and XBLAST 
programs (version 2.0) of Altschul, al. (1990) J. Mol. Biol. 215:403-10. BLAST 
nucleotide searches can be performed with the NBLAST program, score = 100, 

5 wordlength = 1 2 to obtain nucleotide sequences homologous to MCP nucleic acid 
molecules of the invention. BLAST protein searches can be performed with the 
XBLAST program, score = 50, wordlength = 3 to obtain amino acid sequences 
homologous to MCP protein molecules of the invention. To obtain gapped alignments 
for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al. , 

>0 (1997) Nucleic Acids Res. 25(1 7):3389-3402. When utilizing BLAST and Gapped 
BLAST programs, one of ordinary skill in the art will know how to optimize the 
parameters of the program (e.g. , XBLAST and NBLAST) for the specific sequence 
being analyzed. 

" _ Another example of a mathematical algorithm utilized for the comparison of 

25 sequences is the algorithm of Meyers and Miller ((1988) Comput. Appl. Biosci. 4:11- 
17). Such an algorithm is incorporated into the ALIGN program (version 2.0) which is 
part of the GCG sequence alignment software package. When utilizing the ALIGN 
program for comparing amino acid sequences, a PAM120 weight residue table, a gap 
length penalty of 12, and a gap penalty of 4 can be used. Additional algorithms for 
30 sequence analysis are known in the art, and include ADVANCE and ADAM, described 
in Torelli and Robotti (1994) Comput. Appl. Biosci. 10:3-5; and FASTA, described in 
Pearson and Lipman (1988) P.N.A.S. 85:2444-8. 

The percent homology between two amino acid sequences can also be 
accomplished using the GAP\program in the GCG software package (available at 
http://www.gcg.com), using either a Blosum 62 matrix or a PAM250 matrix, and a gap 
weight of 12, 10, 8, 6, or 4 and\length weight of 2, 3, or 4. The percent homology 
between two nucleic acid sequences can be accomplished using the GAP program in the 
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GCG software package, using standard parameters, such as a gap weight of 50 and a 
length weight of 3^ 

A comparative analysis of the gene sequences of the invention with those present 
-in Genbank-has-been -performed using4eclmiques-known-in the.art (see,^g., Bexevanis 



and Ouellette, eds. (1998) Bioinformatics: A Practical Guide to the Analysis of Genes 
and Proteins. John Wiley and Sons: New York). The gene sequences of the invention 
were compared to genes present in Genbank in a three-step process. In a first step, a 
BLASTN analysis (e.g., a local alignment analysis) was performed for each of the 
sequences of the invention against the nucleotide sequences present in Genbank, and the 
10 top 500 hits were retained for further analysis. A subsequent FASTA search (e.g. , a 
combined local and global alignment analysis, in which limited regions of the sequences 
are aligned) was performed on these 500 hits. Each gene sequence of the invention was 
subsequently globally aligned to each of the top three FASTA hits, using the GAP 
program in the GCG software package (using standard parameters). In order to obtain 
1 5 correct results, the length of the sequences extracted from Genbank were adjusted to the 
length of the query sequences by methods well-known in the art. The results of this 
analysis are set forth in Table 4. The resulting data is identical to that which would have 
been obtained had a GAP (global) analysis alone been performed on each of the genes of 
the invention in comparison with each of the references in Genbank, but required 
20 significantly reduced computational time as compared to such a database-wide GAP 
(global) analysis. Sequences of the invention for which no alignments above the cutoff 
values were obtained are indicated on Table 4 by the absence of alignment information. 
It will further be understood by one of ordinary skill in the art that the GAP alignment 
"homology percentages set forth in Table 4 under the heading "% homology (GAP)" are 
25 listed in the European numerical format, wherein a ' ,' represents a decimal point. For 
example, a value of "40,345" in this column represents "40.345%". 

Example 12: Construction and Operation of DNA Microarrays 

The sequences of the invention may additionally be used in the construction and 
30 application of DNA microarrays (the design, methodology, and uses of DNA arrays are 

well known in the art, and are described, for example, in Schena, M. et al. (1995) 

Science 270: 467-470; Wodicka, L. et al. (1997) Nature Biotechnology 15: 1359-1367; 

DeSaizieu, A. et al. (1998) Nature Biotechnology 16: 45-48; and DeRisi, J.L. et al. 

(1997) Science 278: 680-686). 
35 DNA microarrays are solid or flexible supports consisting of nitrocellulose, 

nylon, glass, silicone, or other materials. Nucleic acid molecules may be attached to the 



ATTORNEY DO^T NO.: BGI-129CP 

-56- 



acid mixtures can be hybridized to the immobilized nucleic acid molecules, and the label 
may be used to monitor and measure the individual signal intensities of the hybridized 
molecules at defined regions. This methodology allows the simultaneous quantification 
of-the-relative-or-absolm^^ 



, acid sample or mixture. DNA microarrays, therefore, permit an analysis of the 
expression of multiple (as many as 6800 or more) nucleic acids in parallel (see, e.g., 
Schena, M. (1996) BioEssays 18(5): 427-43 1). 

The sequences of the invention may be used to design oligonucleotide pnmers 
which are able to amplify defined regions of one or more C. glutamicum genes by a 
1 0 nucleic acid amplification reaction such as the polymerase chain reaction. The choice 
and design of the 5' or 3' oligonucleotide primers or of appropriate linkers allows the 
covalent attachment of the resulting PCR products to the surface of a support medium 
described above (and also described, for example, Schena, M. et al. (1995) Science 270: 
467-470). 

1 5 Nucleic acid microarrays may also be constructed by in situ oligonucleotide 

synthesis as described by Wodicka, L. et al. (1997) Nature Biotechnology 15: 1359- 
1367. By photolithographic methods, precisely defined regions of the matrix are 
exposed to light. Protective groups which are photolabile are thereby activated and 
undergo nucleotide addition, whereas regions that are masked from light do not undergo 

20 any modification. Subsequent cycles of protection and light activation permit the 
synthesis of different oligonucleotides at defined positions. Small, defined regions of 
the genes of the invention may be synthesized on microarrays by solid phase 

oligonucleotide synthesis. 

The nucleic acid molecules of the invention present in a sample or mixture of 
25 nucleotides may be hybridized to the microarrays. These nucleic acid molecules can be 
labeled according to standard methods. In brief, nucleic acid molecules {e.g., mRNA 
molecules or DNA molecules) are labeled by the incorporation of isotopically or 
fluorescently labeled nucleotides, e.g., during reverse transcription or DNA synthesis. 
Hybridization of labeled nucleic acids to microarrays is described {e.g., in Schena, M. et 

30 al (1995) supra; Wodicka, L. et al. (1997), supra; and DeSaizieu A. et al. (1998), 
supra). The detection and quantification of the hybridized molecule are tailored to the 
specific incorporated label. Radioactive labels can be detected, for example, as 
described in Schena, M. et al. (1995) supra) and fluorescent labels may be detected, for 
example, by the method of Shalon et al. (1996) Genome Research 6: 639-645). 

35 The application of the sequences of the invention to DNA microarray 

technology, as described above, permits comparative analyses of different strains of C. 
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based on individual transcript profiles and the identification of genes that are important 
for specific and/or desired strain properties such as pathogenicity, productivity and 
stress tolerance are facilitated by nucleic acid array methodologies. Also, comparisons 

of-theprofi^ 
5 reaction are possible using nucleic acid array technology. 

Example 13: Analysis of the Dynamics of Cellular Protein Populations 
(Proteomics) 

The genes, compositions, and methods of the invention may be applied to study 
0 the interactions and dynamics of populations of proteins, termed 'proteomics'. Protein 
populations of interest include, but are not limited to, the total protein population of C. 
glutamics {e.g., in comparison with the protein populations of other organisms), those 
proteins which are active under specific environmental or metabolic conditions (e.g., 
during fermentation, at high or low temperature, or at high or low pH), or those proteins 
1 5 which are active during specific phases of growth and development. 

Protein populations can be analyzed by various well-known techniques, such as 
gel electrophoresis. Cellular proteins may be obtained, for example, by lysis or 
extraction, and may be separated from one another using a variety of electrophoretic 
techniques. Sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) 
20 separates proteins largely on the basis of their molecular weight. Isoelectric focusing 
polyacrylamide gel electrophoresis (IEF-PAGE) separates proteins by their isoelectric 
point (which reflects not only the amino acid sequence but also posttranslational 
modifications of the protein). Another, more preferred method of protein analysis is the 
consecutive combination of both IEF-PAGE and SDS-PAGE, known as 2-D-gel 
25 electrophoresis (described, for example, in Hermann et al. (1998) Electrophoresis 19: 
3217-3221 • Fountoulakis et al. (1998) Electrophoresis 19: 1 193-1202; Langen et al. 
(1997) Electrophoresis 18: 1184-1192; Antelmanne/ al. (1997) Electrophoresis 18: 
1451-1463). Other separation techniques may also be utilized for protein separation, 
such as capillary gel electrophoresis; such techniques are well known in the art. 
30 Proteins separated by these methodologies can be visualized by standard 

techniques, such as by staining or labeling. Suitable stains are known in the art, and 
include Coomassie Brilliant Blue, silver stain, or fluorescent dyes such as Sypro Ruby 
(Molecular Probes). The inclusion of radioactively labeled amino acids or other protein 
precursors {e.g., 35 S-methionine, 35 S-cysteine, ,4 C-labelled amino acids, 1 N-amino 

C Qhitamicum 
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fluorescent labels may be employed. These labeled proteins can be extracted, isolated 
and separated according to the previously described techniques. 

Proteins visualized by these techniques can be further analyzed by measuring the 
amount-of dye or labelused.~T-he amountof a-g^ 



5 quantitatively using, for example, optical methods and can be compared to the amount 
of other proteins in the same gel or in other gels. Comparisons of proteins on gels can 
be made, for example, by optical comparison, by spectroscopy, by image scanning and 
analysis of gels, or through the use of photographic films and screens. Such techniques 

■ are well-known in the art. 

10 To determine the identity of any given protein, direct sequencing or other 

standard techniques may be employed. For example, N- and/or C-terminal amino acid 
sequencing (such as Edman degradation) may be used, as may mass spectrometry (in 
particular MALDI or ESI techniques (see, e.g., Lwgenet al. (1997) Electrophoresis 18: 
1 184-1 192)). The protein sequences provided herein can be used for the identification 

15 of C. glutamicum proteins by these techniques. 

The information obtained by these methods can be used to compare patterns of 
protein presence, activity, or modification between different samples from various 
biological conditions (e.g., different organisms, time points of fermentation, media 
conditions, or different biotopes, among others). Data obtained from such experiments 

20 alone, or in combination with other techniques, can be used for various applications, 
such as to compare the behavior of various organisms in a given {e.g., metabolic) 
situation, to increase the productivity of strains which produce fine chemicals or to 
increase the efficiency of the production of fine chemicals. 



25 Equivalents 

Those of ordinary skill in the art will recognize, or will be able to ascertain using 
no more than routine experimentation, many equivalents to the specific embodiments of 
the invention described herein. Such equivalents are intended to be encompassed by the 
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1125 
1127 



1018 FRXA02138 GR00639 

1020 RXN02151 W0300 

1022 FRXA02151 GR00639 

1024 RXN02169 W0100 

-1026 F RXA02169 GR00641- 

1028 RXN02180 W0100 

1030 FRXA02180 GR00641 

1032 RXN02185 W0100 

1034 FRXA02185 GR00641 

1036 RXN02186 W0100 

1038 FRXA02186 GR00641 

1040 RXN02207 W0302 

1042 F RXA02207 GR00646 

1044 RXN02223 W0308 

1046 F RXA02223 GR00652 

1048 RXN02226 W0068 

1050 F RXA02226 GR00653 

1052 RXN02238 W0204 

1054 FRXA02238 GR00654 

1056 RXN02254 W0202 

1058 F RXA02254 GR00654 

1060 RXN02271 W0020 

1062 F RXA02271 GR00655. 

1064 RXN02279 W0020 

1066 F RXA02279 GR00657 

1068 RXN02296 W0127 

1070 F RXA02296 GR00662 

1072 RXN02300 W0127 

1074 F RXA02300 GR00662 

1076 RXN02301 W0127 

1078 F RXA02301 GR00662 

1080 RXN02302 W0127 

1082 F RXA02302 GR00662 

1084 RXN02303 W0127 

1086 F RXA02303 GR00663 

1088 RXN02307 - - -W0127 - 

1090 F RXA02307 GR00664 

1092 RXN02314 W0025 

1094 F RXA02314 GR00665 

1096 RXN02337 W0141 

1098 F RXA02337 GR00672 

1100 RXN02339 W0195 

1102 FRXA02339 GR00674 

1104 RXN02340 W0195 

1106 FRXA02338 GR00673 

1108 FRXA02340 GR00674 

1110 RXN02341 W0078 

1112 FRXA02341 GR00675 

1114 RXN02360 W0051 

1116 FRXA02360 GR00685 

1118 RXN02361 W0051 

1120 FRXA02361 GR00685 

1122 RXN02367 W0102 

1124 FRXA02367 GR00687 

1126 RXN02368 W0102 

1128 FRXA02368 GR00687 



4409 
19913 
19913 
3172 

-3172 

16813 

16813 

20185 

20185 

21192 

21213 

802 

10909 

2732 

425 

1059 

1059 

1345 

5241 

2 

21769 

14281 

5406 

236 

1 

24138 

6978 

28354 

11194 

29070 

11910 

29196 

12036 

13326 

1 

12611 

395 

23092 

6379 

4679 

2893 

1 

1 

1640 

484 

1214 

4279 

415 

14638 

3644 

12122 

6160 

4639 

2162 

3883 

2918 



4750 
21100 
21100 
4017 

_4Q17 

15356 
15356 
20763 
20763 
20995 
20995 
5 

11667 

3232 

6 

4 

4 

1629 
5525 

22449 

14838 

5963 

1693 

1404 

24626 

7466 

28022 

10862 

28354 

11194 

30074 

12800 

14231 

720 

11991 

6 

23532 
5939 
3357 
3816 

492 
576 
5 

576 

4764 

5 

12206 

6076 

11472 

6810 

5247 

1554 

4557 

2244 
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1131 


1132 


F RXA02381 


1133 


1134 


RXN02383 


1135 


1136 


F RXAQzobo 


1137 


1138 


RXNQ2387 






r- OV A AOOQ7 

— F-RXAO^-oo/ — 


1139 


1-140 




1141 


1142 


RXN02398 


1143 


1144 


F RXAQ2o9o 


1145 


1146 


RXN02406 


1147 


1148 


F RXA024UD 


1149 


1150 


nvuno A A "7 

RXN02407 


1151 


1152 


F RXAQ24U/ 


1153 


1154 


RXN02408 


1155 


1156 


F RXA02408 


1157 


1158 


RXN02409 


1159 


1160 


F RXA02409 


1161 


1162 


RXN02428 


1163 


1164 


F RXA02428 


1165 


1166 


RXN02454 


1167 


1168 


F RXA02454 


1169 


1170 


RXN02457 


1171 


1172 


F RXA02457 


1173 


1174 


RXN02460 


1175 


1176 


F RXA02460 


1177 


1178 


RXN02464 


1179 


1180 


F RXA02464 


1181 


1182 


RXN02465 


1183 


1184 


F RXA02465 


1185 


1186 


RXN02466 


1187 


1188 


F RXA02466 


1189 


1190 


RXN02505 


1191 


1192 


F RXA02505 


1193 


1194 


RXN02510 


1195 


1196 


F RXA02510 


1197 


1198 


RXN02519 


1199 


1200 


F RXA02519 


1201 


1202 


RXN02520 


1203 


1204 


F RXA02520 


1205 


1206 


RXN02534 


1207 


1208 


F RXA02534 


1209 


1210 


RXN02537 


1211 


1212 


r~ nv AAOC57 

F RXA02537 


1213 


1214 


RXN02538 


1215 


1216 


i— nv AftOCOQ 

F RXAUzooo 


1217 


1218 


RXN02000 


1219 


1220 


F RXA025OO 


1221 


1222 


RXN020D4 


1223 


1224 


C DYAH9^fi4 
r KA/MJZOO 4 * 


1225 


1226 


RXN02568 


1227 


1228 


F RXA02568 


1229 


1230 


RXN02593 


1231 


1232 


F RXA02593 


1233 


1234 


F RXA02594 


1235 


1236 


RXN02606 


1237 


1238 


F RXA02606 


1239 


1240 


RXN02610 
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Or\UUUC7 I 


1792 


770 


V VUl 1 O 


639 


4 


HR00692 


608 


6 


W0 176 


2729' 


3490 


npnriRQ4 


683 


6 


-VJ P\W W w*T 

V Vw I 1 w 


12750 


11149 




2841 


4370 


V vuuo*t 


22016 


22564 


GR00701 


1322 


774 


W0084 


21758 


21387 


nR0070i 


1580 


1885 


\A/nnft4 

V VUUOH 


20832 


19921 




832 


5 


\A/nOR4 
V VUUO*t 


21371 


20835 




1248 


835 


\A/m in 


4585 


3452 


f^Rnfi707 
Oi\UU I u / 


4585 


3452 


vvu iyo 


2810 


1569 


fiRn071 1 


3 


815 


W0 124 


19193 


18084 


r;Rnn7i9 

UrwUf \*u 


1295 


2404 


VVU l^n 


14649 


15152 


nR0n719 
Ur\UU/ \£ 


5839 


5336 


\A/noi 1 

VVUZ I I 


1990 


3189 


OtxUU/ IO 


1107 


1613 


\ A/091 1 
VVUZ I I 


3590 


3192 


nR0fl71 ^ 
VJr\UU/ IO 


2014 


1616 


\ A/091 1 


92 


6 


r^pnn7i a 

OrvUU/ l*f 


92 


6 


V VUUU / 


23969 


24139 


r:pnn79n 

Or\UU 1 tVJ 


18423 


18593 


\A/ni7i 

V VU I / I 


17467 


16832 


f^R0n791 


1983 


2618 


\A/H1ft? 
VVU I oo 


2709 


4337 




1933 


128 


\A/niR^ - 


2243 


.1560 


f^R00794 


2222 


2905 


\/\/nn^7 

V VUUO f 


11192 


11995 


GR00726 


5536 


6339 


W0057 

V V Uvw 1 


14617 


15078 


HR00726 


8961 


9422 


won^7 

V VUUO I 


15078 


15749 


GR00726 


9422 


10093 


W0101 


5340 


4738 


fiR00731 


1757 


1155 


W0 154 


10016 


9015 


GR00732 


2543 


3217 


W0245 


1657 


5 


GR00735 


1363 


5 


W0098 


11073 


11669 


GR00741 


18693 


18481 


GR00741 


19077 


18754 


W0098 


34557 


35927 


GR00742 


13514 


12144 


W0098 


31620 


30694 
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1245 


1246 


F RXAUzbz4 


1247 


1248 


RXN02bzb 


1249 


1250 


RXNUzbob 


1251 


1252 


DVMHOC71 

RXNUzb/o 






r~ OVA 


1253 


1254 




1255 


1256 


RXNUzboU 


1257 


1258 


F RXAUzboU 


1259 


1260 


r- OVA AOC7Q 

F RAAUzb/y 


1261 


1262 


F RXAUzbol 


1263 


1264 


RXN02o9o 


1265 


1266 


i— o V A AOCOQ 

F RXAuzbyo 


1267 


1268 


RXNQzb9b 


1269 


1270 


F RXA0269b 


1271 


1272 


RXN02697 


1273 


1274 


F RXA02697 


1275 


1276 


r* O V AA071 C\ 

F RXA027iy 


1277 


1278 


RXN02720 


1279 


1280 


F RXA02720 


1281 


1282 


RXN02744 


1283 


1284 


F RXA02744 


1285 


1286 


RXN02770 


1287 


1288 


F RXA02770 


1289 


1290 


RXN02781 


1291 


1292 


•— r-> \y a A070 H 

F RXA02781 


1293 


1294 


RXN02782 


1295 


1296 


F RXA02782 


1297 


1298 


RXN02812 


1299 


1300 


F RXA02812 


1301 


1302 


RXN02817 


1303 


1304 


F RXA02817 


1305 


1306 


RXN02818 


1307 


1308 


F RXA02818 


1309 


1310 


RXN02825 


1311 


1312 


F RXA01322 


1313 


1314 


F RXA02824 


1315" " 


■ 1316 - 


i— o v a noooc 

F RXA02ozb 


1317 


1318 


RXNOzooo 


1319 


1320 


F RXAOzooo 


1321 


1322 


O V K 1 AO o a n 

RXN02840 


1323 


1324 


F RXAUzo4U 


1325 


1326 


RXNUzo41 


1327 


1328 


i— OVA HO Q A A 

F RXAUzo41 


1329 


1330 


RANUzo4b 


1331 


1332 


r RXAUzo4D 


1333 


A O O A 

1334 


RaNUzo4/ 


1335 


1336 


C OVA no Q /1 7 

F RXAUzo4/ 


1337 


1338 


RXNUzo4y 


1339 


1340 


F RXA02849 


1341 


1342 


RXN02911 


1343 


1344 


RXN02914 


1345 


1346 


RXN02921 


1347 


1348 


RXN02924 


1349 


1350 


RXN02927 


1351 


1352 


RXN02928 


1353 


1354 


RXN02931 


1355 


1356 


RXN02932 



vjrxuu # ho 


5602 


4889 


V V vO i " 


2012 


1008 


wnnQn 

V V UUv/U 


15756 


14917 


V V UO 1 \J 


14030 


13398 


UlAUu / JJ — — - 


14030 


13398 


V V UUC7U 


64917 


66200 


^Rnn754 

vJlxUu f U*t 


6392 


5109 


VJrwU/ v)*t 


5268 


5693 


f^Rno754 


5751 


6194 


V V UU570 


74100 


75875 


Or\UU f uu 


1650 


4 


\A/nni7 

V VUU I f 


7946 


7491 


f^Rnn7 c ifi 

ot\UU 1 jO 


742 


287 


V VUU I / 


31257 


32783 


f^Rn07^7 


1 


699 




19598 


20245 


VVUU I I 


8727 


8026 


f^Rfln75Q 
Or\UU / J3 


631 


5 


V VUU / *t 


893 


1696 


Or\UU / uo 


14460 


13657 


\ A/0171 
V VU I i 1 


4202 


2637 


(^Rnn77° 

O r\uU lie. 


3 


1322 


V VUUOH 


14566 


13376 


Or\UU / / H 


1345 


155 


v vuuyo 


7148 


8446 


r^Rnri77S 


204 


875 


V VUZ 1 u 


342 


4 


r^Rnn7Q^ 

vjr\UU f 


2 


568 


V VU0*fD 


403 


5 


^pnn7QR 
or\UU / yo 


403 


5 




611 


6 




611 
\j i i 


6 


V VUUOZ 


3589 


1751 


VJixUUOOu 


443 
ii j 


6 


OrxuUOUJ 


531 


4 


OlxUUOUO 


565 


. .. .182. . 


wmfii 

V VU 1 U I 


1 






1 


462 


V V UOUJ 


488 


339 




488 


339 


VVUUjj 


11788 


12222 


Or\UUO*+U 


283 


5 


W0 197 

V VU It./ 


30861 


30112 




578 


6 


V VU 1 1 O 


47 


1135 


or\UUO*T / 


598 


5 


V V UZO I 


2 




\jr\wvU7w 


2 


283 


W0135 


24643 


25101 


W0127 


17305 


16763 


W0213 


1871 


1401 


W0088 


4557 


5105 


W0082 


18836 


19303 


W0082 


19511 


20203 


W0090 


25420 


25644 


W0176 


23391 


24362 
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1359 
1361 
1363 
1365 

-1367 

1369 

1371 

1373 

1375 

1377 

1379 

1381 

1383 

1385 

1387 

1389 

1391 

1393 

1395 

1397 

1399 

1401 

1403 

1405 

1407 

1409 

1411 

1413 

1415 

1417 

1419 

1421 

1423 

1425 

1427 

1429 

1431 

1433 

1435 

1437 

1439 

1441 

1443 

1445 

1447 

1449 

1451 

1453 

1455 

1457 

1459 

1461 

1463 

1465 

1467 



1360 
1362 
1364 
1366 

-1-368 

1370 

1372 

1374 

1376 

1378 

1380 

1382 

1384 

1386 

1388 

1390 

1392 

1394 

1396 

1398 

1400 

1402 

1404 

1406 

1408 

1410 

1412 

1414 

1416 

1418 

1420 

1422 

1424 

1426 

1428 

1430 

1432 

1434 

1436 

1438 

1440 

1442 

1444 

1446 

1448 

1450 

1452 

1454 

1456 

1458 

1460 

1462 

1464 

1466 

1468 



RXN02936 
RXN02939 
FRXA01383 
RXN02950 

RXN02951 

RXN02957 

RXN02967 

RXN02971 

RXN02978 

RXN02995 

RXN02997 

RXN03001 

RXN03005 

RXN03009 

RXN03010 

RXN03011 

RXN03012 

RXN03017 

F RXA02753 

RXN03018 

RXN03024 

RXN03025 

RXN03027 

RXN03029 

RXN03031 

RXN03032 

RXN03034 

F RXA00063 

RXN03037 

RXN03041 

F RXA02892 

RXN03045 

RXN03046 

RXN03047 

F RXA00036 

RXN03048 

F RXA00037 

RXN03050 

RXN03053 

F RXA02885 

RXN03055 

F RXA00261 

F RXA02888 

RXN03059 

F RXA02899 

RXN03062 

RXN03066 

F RXA02876 

RXN03067 

RXN03068 

RXN03073 

F RXA02905 

RXN03085 

RXN03089 

F RXA00071 

RYwrW)Q8 



W0197 
W0008 
GR00406 
W0224 

-VVO-176 

W0020 

W0318 

W0210 

W0010 

W0069 

W0069 

W0170 

W0237 

W0238 

W0238 

W0098 

W0241 

W0218 

GR00765 

W0218 

W0003 

W0003 

W0008 

W0009 

W0011 

W0012 

W0013 

GR00010 

W0015 

W0018 

GR10035 

W0019 

W0020 

W0020 

GR00004 

W0020 

GR00004 

W0021 

W0026 

GR10021 

W0026 

GR00039 

GR10024 

W0030 

GR10040 

W0035 

W0038 

GR10016 

W0038 

W0038 

W0042 

GR10044 

W0048 

W0053 

GR00011 

W0064 



24360 
33988 
1147 
7629 

_5739_ 



24557 
32387 
5 

7306 

_513J_ 



30448 

6614 

951 

2191 

348 

3709 

422 

1101 

353 

7435 

2984 

2 

5720 

2630 

7221 

6315 

8668 

17 

95 

1 

3652 
2 

1658 

5364 

1770 

1171 

33044 

1 

25070 

7204 

27423 

9557 

6368 

12 

1 

3519 

11693 

326 

5373 

2125 

525 

7298 

405 

7493 

7648 

1573 

477 

4511 

1183 

4013 

2100 



30158 

6931 

640 

683 

1913 

2981 

874 

334 

6 

7199 
3184 
571 
7258 

138 

8213 

7730 

7796 

151 

607 

789 

3936 

661 

1374 

5549 

1273 

668 

34039 

336 

26485 

8619 

26551 

8685 

7333 

1535 

1536 

3947 

11265 

754 

4894 

1646 

4 

6636 

1067 

7323 

7529 

944 

4 

4161 
26 

5464 
2723 
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1473 

1475 

1477 

1479 

1481 — 

1483 

1485 

1487 

1489 

1491 

1493 

1495 

1497 

1499 

1501 

1503 

1505 

1507 

1509 

1511 

1513 

1515 

1517 

1519 

1521 

1523 

1525 

1527 

1529 

1531 

1533 

1535 

1537 

1539 

1541 

1543 

1545 

1547 

1549 

1551 

1553 

1555 

1557 

1559 

1561 

1563 

1565 

1567 

1569 

1571 

1573 

1575 

1577 

1579 

1581 



1474 

1476 

1478 

1480 

-1482— 
1484 
1486 
1488 
1490 
1492 
1494 
1496 
1498 
1500 
1502 
1504 
1506 
1508 
1510 
1512 
1514 
1516 
1518 
1520 
1522 
1524 
1526 
1528 
1530 
1532 
1534 
1536 
1538 
1540 
1542 
1544 
1546 
1548 
1550 
1552 
1554 
1556 
1558 
1560 
1562 
1564 
1566 
1568 
1570 
1572 
1574 
1576 
1578 
1580 
1582 



RXN03104 

RXN03106 

RXN03107 

RXN03113 

FRXA00506- 

RXN03115 

RXN03122 

RXN03134 

RXN03135 

F RXA02285 

RXN03138 

RXN03140 

RXN03141 

RXN03146 

RXN03147 

RXN03149 

RXN03152 

RXN03153 

RXN03154 

RXN03156 

FRXA00176 

RXN03162 

RXN03167 

F RXA02862 

RXN03170 

F RXA02856 

RXN03172 

F RXA02858 

RXN03173 

F RXA02874 

RXN03174 

F RXA02884 

RXN03177 

F RXA02881 

RXN03182 

RXN03184 

RXN03185 

RXA00003 

RXA00008 

RXA00015 

RXA00018 

RXA00020 

RXA00021 

RXA00025 

RXA00031 

RXA00049 

RXA00052 

RXA00054 

RXA00058 

RXA00059 

RXA00065 

RXA00068 

RXA00079 

RXA00082 

RXA00083 



W0071 
W0074 
W0076 
W0086 
-GR00-126— 
W0089 
W0104 
W0127 
W0127 
GR00660 
W0129 
W0131 
W0135 
W0143 
W0144 
W0146 
W0166 
W0176 
W0179 
W0187 
GR00027 
W0195 
W0327 
GR10006 
W0328 
GR10003 
W0329 
GR10004 
W0330 
GR10015 
W0331 
GR10020 
W0333 
GR10019 
W0339 
W0374 
W0375 
GR00001 
GR00002 
GR00002 
GR00002 
GR00002 
GR00002 
GR00003 
GR00003 
GR00008 
GR00008 
GR00008 
GR00009 
GR00009 
GR00010 
GR00011 
GR00012 
GR00012 
GR00013 

r-» /"v /"\ .d a 



401 

15930 
232 
6541 
_489 

148 

3329 

65312 

66674 

1544 

21194 

4550 

31144 

25998 

2726 

969 

264 

46481 

1328 

4908 

3475 

909 

633 

1695 

457 

459 

1392 

1392 

1340 

1348 

461 

1695 

816 

94 

276 

517 

311 

2279 

606 

5999 

12979 

17142 

18766 

2211 

10383 

2270 

7957 

8557 

7394 

8301 

4140 

1305 

6599 

9019 

771 



577 
16121 
432 
8139 

J.829 

546 

3475 

65662 

67402 

2272 

21664 

4302 

31473 

26468 

2977 

1235 

536 

47044 

2239 

5087 

3317 

1304 

4 

2330 

209 

211 

367 

367 

243 

869 

6 

2156 
151 
759 
4 

20 . 

123 

3019 

115 

6307 

14277 

16363 

20538 

3647 

9982 

2956 

7247 

11469 

6831 

8020 

4412 

724 

6820 

8456 

1070 
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1587 

1589 

1591 

1593 
-1595 — 

1597 

1599 

1601 

1603 

1605 

1607 

1609 

1611 

1613 

1615 

1617 

1619 

1621 

■1623 

1625 

1627 

1629 

1631 

1633 

1635 

1637 

1639 

1641 

1643 

1645 

1647 

1649 

1651 

1653 

1655 

1657 

1659 

1661 

1663 

1665 

1667 

1669 

1671 

1673 

1675 

1677 

1679 

1681 

1683 

1685 

1687 

1689 



1588 

1590 

1592 

1594 
-1596 — 

1598 

1600 

1602 

1604 

1606 

1608 

1610 

1612 

1614 

1616 

1618 

1620 

1622 

1624 

1626 

1628 

1630 

1632 

1634 

1636 

1638 

1640 

1642 

1644 

1646 

1648 

1650 

1652 

1654 

1656 

1658 

1660 

1662 

1664 

1666 

1668 

1670 

1672 

1674 

1676 

1678 

1680 

1682 

1684 

1686 

1688 

1690 
■icon 



RXA00108 
RXA00110 
RXA00117 
RXA00118 
-RXA001-19— 
RXA00121 
RXA00122 
RXA00127 
RXA00134 
RXA00140 
RXA00141 
RXA00142 
RXA00150 
RXA00151 
RXA00153 
RXA00155 
RXA00159 
RXA00161 
RXA00169 
RXA00170 
RXA00173 
RXA00174 
RXA00175 
RXA00179 
RXA00180 
RXA00183 
RXA00185 
RXA00199 
RXA00200 
RXA00207 
RXA00211 
RXA00218 
RXA00220 
RXA00230 
RXA00233 
RXA00234 
RXA00237 
RXA00238 
RXA00239 
RXA00240 
RXA00244 
RXA00245 
RXA00248 
RXA00250 
RXA00252 
RXA00257 
RXA00258 
RXA00260 
RXA00273 
RXA00274 
RXA00275 
RXA00276 

RYAnfi97Q 



GR00015 
GR00016 
GR00019 
GR00019 
-GR000-19— 
GR00019 
GR00019 
GR00020 
GR00021 
GR00022 
GR00022 
GR00022 
GR00023 
GR00023 
GR00023 
GR00023 
GR00024 
GR00024 
GR00026 
GR00026 
GR00027 
GR00027 
GR00027 
GR00028 
GR00028 
GR00028 
GR00028 
GR00031 
GR00031 
GR00032 
GR00032 
GR00032 
GR00032 
GR00034 
GR00036 
GR00036 
GR00036 
GR00036 
GR00036 
GR00036 
GR00037 
GR00037 
GR00037 
GR00038 
GR00038 
GR00039 
GR00039 
GR00039 
GR00042 
GR00042 
GR00042 
GR00042 
GR00043 



546 
364 
791 
918 

J7-04 

3473 
4220 

2871 

1648 

3841 

4307 

4776 

4085 

4956 

7656 

8615 

3868 

4893 

5222 

9914 

1716 

2079 

2732 

1714 

2334 

7344 

9418 

2172 

2837 

6430 

10120 

18104 

20666 

746 

420 

998 

3668 

4186 

5118 

5342 

1565 

3049 

7843 

6 

485 

1760 

3219 

9234 

185 

1556 

4696 

5016 

4001 



4 

912 
201 
1172 

2462 

4183 
5842 
2416 
1079 
3656 
3846 
4300 
4858 
5552 
7231 
9397 
2687 
5354 
3150 
8061 

1384 

1795 

2103 

1256 

1795 

8195 

12045 

754 

2535 

6747 

10782 

19243 

20163 

27 

4 

459 

4045 

4554 

4534 

5133 

930 

1565 

7121 

221 

727 

2215 

3890 

10409 

1297 

4165 

4238 

4675 

2616 
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1701 

1703 

1705 

1707 

1709~ 

1711 

1713 

1715 

1717 

1719 

1721 

1723 

1725 

1727 

1729 

1731 

1733 

1735 

1737 

1739 

1741 

1743 

1745 

1747 

1749 

1751 

1753 

1755 

1757 

1759 

1761 

1763 

1765 

1767 

1769 

1771 

1773 

1775 

1777 

1779 

1781 

1783 

1785 

1787 

1789 

1791 

1793 

1795 

1797 

1799 

1801 

1803 

1805 

1807 

1809 

1811 

1813 



1702 
1704 
1706 
1708 

-mo— 

1712 

1714 

1716 

1718 

1720 

1722 

1724 

1726 

1728 

1730 

1732 

1734 

1736 

1738 

1740 

1742 

1744 

1746 

1748 

1750 

1752 

1754 

1756 

1758 

1760 

1762 

1764 

1766 

1768 

1770 

1772 

1774 

1776 

1778 

1780 

1782 

1784 

1786 

1788 

1790 

1792 

1794 

1796 

1798 

1800 

1802 

1804 

1806 

1808 

1810 

1812 

1814 



RXA00297 

RXA00320 

RXA00321 

RXA00322 

RXA00325— 

RXA00326 

RXA00336 

RXA00337 

RXA00339 

RXA00349 

RXA00355 

RXA00375 

RXA00380 

RXA00387 

RXA00392 

RXA00394 

RXA00395 

RXA00396 

RXA00397 

RXA00398 

RXA00408 

RXA00409 

RXA0041 1 

RXA00423 

RXA00424 

RXA00425 

RXA00428 

RXA00429 

RXA00430 

RXA00433 

RXA00451 

RXA00457 

RXA00462 

RXA00463 

RXA00468 

RXA00469 

RXA00472 

RXA00474 

RXA00475 

RXA00476 

RXA00481 

RXA00486 

RXA00490 

RXA00491 

RXA00493 

RXA00519 

RXA00528 

RXA00529 

RXA00530 

RXA00535 

RXA00540 

RXA00549 

RXA00550 

RXA00553 

RXA00554 

RXA00563 

RXA00564 



GR00048 
GR00057 
GR00057 
GR00057 
-GR00057— 
GR00057 
GR00057 
GR00058 
GR00059 
GR00066 
GR00069 
GR00080 
GR00082 
GR00084 
GR00086 
GR00086 
GR00086 
GR00086 
GR00086 
GR00087 
GR00091 
GR00091 
GR00092 
GR00097 
GR00097 
GR00097 
GR00098 
GR00098 
GR00098 
GR00100 
GR00110 
GR00114 
GR00116 
GR00116 
GR00118 
GR00119 
GR00119 
GR00119 
GR00119 
GR00119 
GR00119 
GR00120 
GR00121 
GR00122 
GR00123 
GR00132 
GR00136 
GR00136 
GR00136 
GR00137 
GR00139 
GR00143 
GR00143 
GR00145 
GR00145 
GR00151 
GR00151 



2861 
358 
2411 
3658 

-8594 

9378 

19461 

530 

817 

3 

635 

549 

836 

1403 

3890 

5322 

5417 

6653 

7206 

1 

642 

1088 

1685 

909 

1379 

1433 

2657 

3063 

3473 

1446 

816 

1451 

3023 

4209 

1282 

1647- 

5449 

6575 

8822 

8961 

17636 

1 

2676 
1057 
3 
4 

3562 

5274 

6837 

5155 

2027 

502 

935 

742 

1606 

1 

3744 



3772 
537 
597 
2555 

_9238 

9857 

19931 

6 

1533 

1061 

510 

49 

216 

591 

3027 

4990 

5716 

6183 

6667 

681 

1088 

2500 

1011 

457 

909 

1657 

2025 

2662 

3063 

1970 

325 

372 

1644 

3388 

464 

472 . 

4589 

8152 

8163 

9821 

18220 

702 

1774 

638 

326 

516 

4650 

4732 

5557 

5871 

2269 

897 

1255 

1062 

1136 

2739 

4148 
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1815 
1817 
1819 
1821 

-1823 — 

1825 

1827 

1829 

1831 

1833 

1835 

1837 

1839 

1841 

1843 

1845 

1847 

1849 

1851 

1853 

1855 

1857 

1859 

1861 

1863 

1865 

1867 

1869 

1871 

1873 

1875 

1877 

1879 

1881 

1883 

1885 

1887 

1889 

1891 

1893 

1895 

1897 

1899 

1901 

1903 

1905 

1907 

1909 

1911 

1913 

1915 

1917 

1919 

1921 

1923 

1925 



1816 

1818 

1820 

1822 

1824 — 

1826 

1828 

1830 

1832 

1834 

1836 

1838 

1840 

1842 

1844 

1846 

1848 

1850 

1852 

1854 

1856 

1858 

1860 

1862 

1864 

1866 

1868 

1870 

1872 

1874 

1876 

1878 

1880 

1882 

1884 

1886 

1888 

1890 

1892 

1894 

1896 

1898 

1900 

1902 

1904 

1906 

1908 

1910 

1912 

1914 

1916 

1918 

1920 

1922 

1924 

1926 



RXA00573 

RXA00576 

RXA00577 

RXA00578 

RXA00582 — 

RXA00585 

RXA00586 

RXA00587 

RXA00595 

RXA00597 

RXA00598 

RXA00601 

RXA00602 

RXA00604 

RXA00610 

RXA00611 

RXA00613 

RXA00614 

RXA00617 

RXA00628 

RXA00631 

RXA00637 

RXA00646 

RXA00649 

RXA00652 

RXA00654 

RXA00656 

RXA00657 

RXA00661 

RXA00667 

RXA00676 

RXA00678 

RXA00691 

RXA00692 

RXA00693 

RXA00701 

RXA00707 

RXA00713 

RXA00714 

RXA00716 

RXA00719 

RXA00724 

RXA00726 

RXA00740 

RXA00741 

RXA00742 

RXA00743 

RXA00745 

RXA00746 

RXA00747 

RXA00748 

RXA00749 

RXA00751 

RXA00752 

RXA00757 

RXA00763 



GR00156 

GR00156 

GR00156 

GR00156 

GR00156 — 

GR00156 

GR00156 

GR00156 

GR00159 

GR00159 

GR00159 

GR00159 

GR00159 

GR00159 

GR00161 

GR00161 

GR00162 

GR00162 

GR00162 

GR00165 

GR00166 

GR00167 

GR00169 

GR00169 

GR00169 

GR00169 

GR00169 

GR00169 

GR00172 

GR00175 

GR00178 

GR00179 

GR00181 

GR00181 

GR00181 

GR00182 

GR00185 

GR00188 

GR00188 

GR00188 

GR00188 

GR00191 

GR00192 

GR00202 

GR00202 

GR00202 

GR00202 

GR00202 

GR00202 

GR00202 

GR00202 

GR00202 

GR00202 

GR00202 

GR00203 

GR00204 



117 
2916 
2980 
4087 

^9442 

11894 
12818 
13008 
3 

797 

1070 

3459 

4907 

5489 

1193 

3640 

1652 

1680 

4002 

1284 

172 

2002 

446 

2823 

5449 

7213 

9495 

10882 
664 

593 

647 

1037 

2152 

3450 

4303 

427 

377 

71 

1809 

3002 

5283 

811 

841 

1646 

2986 

5517 

6652 

13874 

13755 

15067 

15917 

17240 

20245 

21847 

3119 

1384 



767 
2245 
3327 
3365 

_8924 

11577 

11937 

13490 

332 

1066 

1387 

3749 

4155 

5779 

2056 

2165 

1200 

2594 

5084 

877 

1626 

2754 

6 

3278 

5997 

8478 

9235 

9980 

1353 

1177 

1393 

303 

1223 

2317 

3821 

801 

1348 

1033 

1249 

3514 

6911 

164 

701 
1068 

2054 

3868 

6230 

13341 

14945 

15654 

16360 

16542 

19418 

21419 

4372 

2166 
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1929 


1930 


RXA00781 


1931 


1932 


RXA00788 


1933 


1934 


RXA00804 


1935 


1936 


RXA00805 






— RXAG0808- 


1937 


1938 




1939 


1940 


RXA00812 


1941 


1942 


RXA00814 


1943 


1944 


RXA00815 


1945 


1946 


RXA00816 


1947 


1948 


RXA00826 


1949 


1950 


RXA00830 


1951 


1952 


RXA00853 


1953 


1954 


RXA00861 


1955 


1956 


RXA00862 


1957 


1958 


RXA00874 


1959 


1960 


RXA00876 


1961 


1962 


RXA00881 


1963 


1964 


RXA00882 


1965 


1966 


RXA00883 


1967 


1968 


RXA00887 


1969 


1970 


RXA00889 


1971 


1972 


RXA00893 


1973 


1974 


RXA00895 


1975 


1976 


RXA00904 


1977 


1978 


RXA00908 


1979 


1980 


RXA00916 


1981 


1982 


RXA00926 


1983 


1984 


RXA00930 


1985 


1986 


RXA00932 


1987 


1988 


RXA00933 


1989 


1990 


RXA00940 


1991 


1992 


RXA00949 


1993 


1994 


RXA00969 


1995 


1996 


RXA00973 


1997 


1998 


RXA00978 


1999 


2000 


RXA00986 


2001 


2002 


RXA00987 


2003 


2004 


RXA00988 


2005 


2006 


RXA01005 


2007 


2008 


RXA01007 


2009 


2010 


RXA01008 


2011 


2012 


RXA01011 


2013 


2014 


RXA01017 


2015 


2016 


RXA01021 


2017 


2018 


RXA01029 


2019 


2020 


RXA01031 


2021 


2022 


RXA01032 


2023 


2024 


RXA01033 


2025 


2026 


RXA01034 


2027 


2028 


RXA01035 


2029 


2030 


RXA01036 


2031 


2032 


RXA01037 


2033 


2034 


RXA01038 


2035 


2036 


RXA01039 


2037 


2038 


RXA01040 


2039 


2040 


RXA01041 



GR00206 
GR00209 
GR00215 
GR00215 
-GR002-1-7— 
GR00219 
GR00219 
GR00219 
GR00219 
GR00223 
GR00224 
GR00231 
GR00235 
GR00236 
GR00241 
GR00241 
GR00242 
GR00242 
GR00242 
GR00242 
GR00242 
GR00244 
GR00244 
GR00246 
GR00247 
GR00251 
GR00253 
GR00253 
GR00253 
GR00253 
GR00257 
GR00259 
GR00273 
GR00274 
GR00276 
GR00280 
GR00280 
GR00280 
GR00286 
GR00287 
GR00287 
GR00288 
GR00290 
GR00291 
GR00295 
GR00295 
GR00295 
GR00295 
GR00295 
GR00295 
GR00295 
GR00295 
GR00295 
GR00295 
GR00295 
GR00295 



2682 
910 
438 
2057 

-1.029 

287 

2463 

3236 

4382 

567 

266 

3775 

6 

580 

758 

4208 

8057 

8788 

10060 

13544 

15341 

789 

2578 

1457 

1611 

4108 

466 

3841 

5068 

6047 

129 

5400 

1 

2272 

217 

60 

875 

1371 

520 

2572 

2719 

2089 

2175 

1759 

1338 

3182 

3974 

4363 

5177 

5818 

6513 

7000 

7530 

9540 

9711 

10780 
a Ac\aa 



2395 
686 
881 
2938 

_352 

1345 

3236 

3808 

4678 

37 

988 

3173 

431 

17 

1846 

2454 

8434 

9465 

9542 

14266 

15928 

193 

1988 

702 

2168 

518 

104 

3089 

5541 

5586 

524 

6047 

147 

1670 

831 

401 

411 

949 

1365 

866 

4659 

857 

1567 

2280 

1826 

3847 

4348 

4698 

4824 

6423 

6965 

7527 

8276 

8965 

10613 

10932 
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2043 


ZU44 


KAMU IUHO 


GR00295 


12774 


13346 


on>i c 

2045 


ZU4t> 


KAMU IU*+*t 


GR00295 


14024 


15280 


2047 


ZU4o 


RYAmru^ 

KAMU 1 \JHD 


GR00295 


15407 


17230 


2049 


oncn 
ZUDU 


KAMU 1 \JH\J 


GR00295 


17441 


19219 


one a 


onco 


RYAfi1fi47 


GR00295 


19244 


19717 


2051 


ZUoZ 


r\AnU IXJHI 


■■ — \?l\UUtv/w 




8246 


2053 


onc^ 
ZUo4 


f\AnU I UOO 


GR00296 


8566 


once 

2055 


onKC 

ZUDO 


IaAAU IUOO 


GR00297 


828 


499 


2057 


on^Q 


KAMU 1 UOO 


GR00298 


605 


1330 


2059 


oncn 
ZUoU 


KAMU I UOO 


GR00298 


2184 


3254 


2061 


ZUoZ 


KAMU I U f H 


GR00300 


2811 


2107 


2063 


2064 


DYAnm7P. 
KAMU 1 U / O 


GR00300 


4374 


3355 


2065 


once 
2066 


DY AH1 07ft 
KAMU I U / O 


GRDO^OO 


6043 


6876 


onm 

2067 


onco. 
20bo 


DY AfM Hft^ 
KAMU IUOO 


GR00302 


1777 


1502 


noon 

2069 


on7n 
2070 


KAMU1UOO 


ur\Uu jut 


3083 


1902 


2071 


0A70 

20 /Z 


DYAnmoi 
kamu i uy I 


GR00305 


546 


76 


2073 


on"7/i 
20/4 


kamu i uyz 


GR00305 


702 


881 


2075 


207b 


kamu 1 uyz 


GR00305 


702 


881 


2077 


20/o 


DY AO.1 HQR 

kamu i uyo 


GR00306 


4341 


3643 


2079 


onon 

2080 


nvAni 1 no 
KAMU1 1UZ 




10018 


8774 


2081 


onoo 

2082 


DYAni 1 0^ 
KAMU I I UO 


GR00306 


10316 


10092 


2083 


ono a 

2084 


DY Ani 1 H7 
KAMU II U ( 




13612 


14811 


2085 


2086 


DYAni 1 HP 

kamuttuo 




15562 


14912 


2087 


onoo 

2088 


DYAni 1 no 

KAMunuy 


OiaUUOUO 


16281 


15640 


2089 


onnn 

2090 


DYAni 110 

KAMU1 uy 


Or\UUO I U 


1068 


139 


2091 


onno 

2092 


DYAni 1 00 

KAMU 11 ZZ 


fiRnn^i 1 


557 


36 


2093 


onn a 

2094 


DYAni 10°. 
KAMU I I ZO 


\J lAUUO 1 1 


1090 


644 


2095 


2096 


DYAni 1 07 

KAMU1 iZ/ 


nRnn^iA 

UiaUUO I H 


2 


280 


2097 


onno 

2098 


DYAni 1 oo 

KAMUiTZy 


OiaUUO I*t 


1461 

1 *tv 1 


3326 


2099 


o a nn 

2100 


DYAni 1 Q1 

KAMU i lol 


^Rnn^iR 

OrxUUO 1 u 


445 


1311 


2101 


o a no 

2102 


DYAni 1 Q7 

KAMUIIo/ 


Or\UUO 1 O 


1101 

1 1 w 1 


1460 


2103 


o A r\A 

2104 


DYAni 1 

KAMU1 TOD 




1588 


1388 


2105 


o a nc 

2106 


DYAni 1 £8 

KAMU mOo 




2580 


1639 


2107 


o a no 
2108 


DYAni 1 

kamu 1 1 oy 




3089 


2775 


r\ a r\r\ 

2109 


211U 


DYAni i£n 

KAMU I I OU 


GR00328 


4187 


3213 


f\ A A A 

2111 


O A A O 

2112 


DYAni 1 AO. 
KAMU I IOO 




710 


6 


- O A A O - 

2113 


- nA A A 
2114 


- - DYAnUfi^ 
KAMU I I DO 


GROOO.32 


-2155 - 


1583 


O "* A C 

2115 


01 1 c 


RYAm 1fifi 
KAMU I I DO 


GR00332 


3005 


2523 


r\ a A—r 

2117 


21 1o 


DYAni 17n 
KAMU I I / U 


GR00334 


638 


1120 


O A A C\ 

21 19 


01 on 
Z1ZU 


DYAni 171 
KAMU MM 


GR00334 


1714 


2406 


O A 0"1 

2121 


01 oo 
2122 


DYAni 17fi 
KAMU I I / D 


GR00335 


1980 


1477 


o h oo 

2123 


0 1 Ovl 

Z 1Z4 


RYAm 177 
KAMU II// 


GR00335 


2121 


4106 


2125 


01 OC 


RYAD1 17fl 
KAMU II f O 


GR00335 


4106 


4555 


0*107 

212/ 


Z1ZO 


RYA01 184 

W AMU I I OH 


GR00338 


1489 


17 


21 2» 


01 Qn 
ZlOU 


RYAm 

r\AMU I I OO 


GR00338 


3742 


2645 


2131 


2132 


RXA01186 


GR00338 


3742 


2645 


2133 


2134 


RXA01187 


GR00338 


3850 


4308 


2135 


2136 


RXA01195 


GR00343 


1413 


1859 


2137 


2138 


RXA01196 


GR00343 


1889 


2578 


2139 


2140 


RXA01197 


GR00343 


3333 


2881 


2141 


2142 


RXA01198 


GR00343 


3422 


3724 


2143 


2144 


RXA01207 


GR00347 


126 


773 


2145 


2146 


RXA01213 


GR00351 


1508 


282 


2147 


2148 


RXA01218 


GR00353 


1078 

/"v <-\ O 


1506 

ocn 
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2157 
2159 
2161 
2163 
2165 — 
2167 
2169 
2171 
2173 
2175 
2177 
2179 
2181 
2183 
2185 
2187 
2189 
2191 
2193 
2195 
2197 
2199 
2201 
2203 
2205 
2207 
2209 
2211 
2213 
2215 
2217 
2219 
2221 
- 2223 
2225 
2227 
2229 
2231 
2233 
2235 
2237 
2239 
2241 
2243 
2245 
2247 
2249 
2251 
2253 
2255 
2257 
2259 



2158 

2160 

2162 

2164 
-2166 — 

2168 

2170 

2172 

2174 

2176 

2178 

2180 

2182 

2184 

2186 

2188 

2190 

2192 

2194 

2196 

2198 

2200 

2202 

2204 

2206 

2208 

2210 

2212 

2214 

2216 

2218 

2220 

2222 

2224 

2226 

2228 

2230 

2232 

2234 

2236 

2238 

2240 

2242 

2244 

2246 

2248 

2250 

2252 

2254 

2256 

2258 

2260 



RXA01271 

RXA01273 

RXA01282 

RXA01294 

RXA01295 

RXA01304 

RXA01310 

RXA01313 

RXA01315 

RXA01316 

RXA01317 

RXA01318 

RXA01330 

RXA01333 

RXA01336 

RXA01342 

RXA01348 

RXA01349 

RXA01357 

RXA01359 

RXA01366 

RXA01367 

RXA01370 

RXA01372 

RXA01378 

RXA01380 

RXA01384 

RXA01396 

RXA01397 

RXA01401 

RXA01402 

RXA01405 

RXA01413 

RXA01414 

RXA01417 

RXA01421 

RXA01425 

RXA01429 

RXA01439 

RXA01440 

RXA01441 

RXA01445 

RXA01447 

RXA01452 

RXA01456 

RXA01457 

RXA01463 

RXA01469 

RXA01470 

RXA01472 

RXA01473 

RXA01474 



GR00367 
GR00367 
GR00369 
GR00373 
-GR0037-3— 
GR00376 
GR00380 
GR00381 
GR00382 
GR00382 
GR00382 
GR00382 
GR00387 
GR00389 
GR00389 
GR00389 
GR00392 
GR00392 
GR00393 
GR00393 
GR00397 
GR00397 
GR00398 
GR00399 
GR00401 
GR00403 
GR00406 
GR00408 
GR00408 
GR00409 
GR00409 
GR00410 
GR00412 
GR00412 
GR00414 
GR00416 
GR00417 
GR00417 
GR00418 
GR00418 
GR00418 
GR00418 
GR00418 
GR00419 
GR00420, 
GR00420 
GR00421 
GR00422 
GR00422 
GR00422 
GR00422 
GR00422 



23467 
26475 
5444 
3537 

-3764 

1982 

803 

1116 

1394 

1855 

2296 

3616 

569 

1231 

3640 

11296 

261 

1531 

4357 

6857 

1369 

1518 

1875 

1 

1281 
2 

3238 

6475 

6894 

3193 

3508 

1844 

854 

1628 

645 

1215 

1701 

5651 

5949 

7496 

8542 

15083 

17885 

2363 

898 

1499 

2493 

2091 

4112 

5783 

6596 

6678 

7fi51 



21656 
25042 
4665 
2872 

_4738 

2467 

477 

172 

744 

1553 

1877 

2315 

1024 

227 

3038 

12807 

752 

755 

4659 

8038 

980 

1919 

2225 

591 

4 

2017 

1523 

6218 

6475 

3453 

3981 

1389 

1453 

2134 

49 

829 

2585 

6268 

6494 

6489 

7514 

14091 

18733 

2641 

1419 

2173 

1330 

3122 

3687 

5328 

5832 

7223 

7226 



Atty. Dckt. No.:^^29CP 
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2271 

2273 

2275 

2277 

2279- 

2281 

2283 

2285 

2287 

2289 

2291 

2293 

2295 

2297 

2299 

2301 

2303 

2305 

2307 

2309 

2311 

2313 

2315 

2317 

2319 

2321 

2323 

2325 

2327 

2329 

2331 

2333 

2335 

2337 

2339 

2341" 

2343 

2345 

2347 

2349 

2351 

2353 

2355 

2357 

2359 

2361 

2363 

2365 

2367 

2369 

2371 



2272 
2274 
2276 
2278 
-2280- 
2282 
2284 
2286 
2288 
2290 
2292 
2294 
2296 
2298 
2300 
2302 
2304 
2306 
2308 
2310 
2312 
2314 
2316 
2318 
2320 
2322 
2324 
2326 
2328 
2330 
2332 
2334 
2336 
2338 
2340 
"2342 
2344 
2346 
2348 
2350 
2352 
2354 
2356 
2358 
2360 
2362 
2364 
2366 
2368 
2370 
2372 



RXA01501 

RXA01504 

RXA01505 

RXA01506 

RXA01507 

RXA01519 

RXA01520 

RXA01523 

RXA01525 

RXA01527 

RXA01536 

RXA01540 

RXA01543 

RXA01544 

RXA01545 

RXA01546 

RXA01547 

RXA01548 

RXA01552 

RXA01554 

RXA01560 

RXA01575 

RXA01577 

RXA01579 

RXA01585 

RXA01586 

RXA01595 

RXA01600 

RXA01602 

RXA01605 

RXA01610 

RXA01611 

RXA01612 

RXA01619 

RXA01622 

RXA01623 

RXA01624 

RXA01628 

RXA01630 

RXA01641 

RXA01642 

RXA01643 

RXA01645 

RXA01646 

RXA01656 

RXA01665 

RXA01671 

RXA01673 

RXA01675 

RXA01676 

RXA01677 



GR00424 
GR00424 
GR00424 
GR00424 
-GR00424- 
GR00424 
GR00424 
GR00424 
GR00424 
GR00425 
GR00427 
GR00428 
GR00430 
GR00430 
GR00430 
GR00430 
GR00430 
GR00430 
GR00431 
GR00432 
GR00435 
GR00438 
GR00438 
GR00439 
GR00441 
GR00441 
GR00447 
GR00447 
GR00447 
GR00448 
GR00449 
GR00449 
GR00449 
GR00451 
GR00452 
-GR00452 
GR00452 
GR00453 
GR00454 
GR00456 
GR00456 
GR00456 
GR00456 
GR00456 
GR00460 
GR00463 
GR00466 
GR00467 
GR00467 
GR00467 
GR00467 



8130 

10710 

11318 

11815 
42239- 

23725 

24784 

27951 

32301 

5126 

4066 

3083 

2802 

3496 

4838 

5584 

6371 

7432 

6122 

3719 

767 

8024 

8811 

671 

1226 

1597 

3326 

10460 

13591 

960 

4343 

4832 

5235 

2407 

1908 

■ 2514 - 
3220 
866 
341 
5182 
6557 
8374 
10574 
11513 
1548 
2152 
854 
1807 
2824 
4179 
5043 



7843 
11318 
11815 
12225 
J2661 
24471 
25167 
28901 
30580 
2616 
2825 
2382 
37 

2897 

3588 

4889 

5709 

6425 

5145 

1578 

438 

7005 

9185 

1054 

600 

1229 

4285 

11128 

12062 

2474 

3615 

4476 

4891 

1433 

2510 

3224. • 

3564 

1879 

1417 

6552 

7798 

7949 

9969 

10695 

2444 

1433 

1468 

773 

3234 

3424 

4300 



Atty.Dckt.No.:«29CP 



■22- 



GR00489 



2636 3154 



m < 2 386 RXA01729 ^ 109 807 

s s ss = " « 

» "96 RXAO" GR0O495 , 878 3518 



2389 2390 RXA01738 ^ 7535 6738 

2"! 23. wo} gSL-a* <* 



2397 2398 RXA01750 557 6 

2400 RXA01752 GROW 557 

2403 2404 *J GR00501 827 

» 2406 S GR00501 5 34 JJ> 

2407 2408 W 0J 3 4 * 



2409 2410 JJWB ^ 178 741 

2411 2412 S GR00504 838 ?289 

2414 WJ™ GR00504 

2416 SS 6R00504 

2418 ^ 6R00504 

2420 ^ GR00504 

2422 RJA 6 R00504 -■- m 

2423 2424 RXA GR005 04 6384 

2425 2426 R« ^ 68 42 ^078 



2413 2414 ss; «5 23 * s 

2415 2416 5vini778 6R00504 2912 4W 

2417 2418 S GR00504 424 ^ S 

2410 2420 S GR00504 5 

2421 2422 R» RQ0504 6052 6312 



2427 2428 RXA01783 ?29 1304 

2430 RXA01785 GR 801 

2S 2432 S!S SS 375 151 

2433 2434 RW GR00506 1672 ™ 



2^ 2 1f s SS! = 1385 2£ 

2437 243 Kg GR00506 

2439 ^ S 793 GR00506 

2441 2442 RXAO 793 

2444 RXAO 794 377 570 



2443 W» Z^. m GR00509 1573 

2445 2446 RXA 79 22 

W 2448 SS GR00509 6 7 ™ 

2440 2450 WM»J GR0050 9 6515 8 85 

M « 2452 RXA018 » ^ 7 4 
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2499 


2500 


\/ A A A A C f\ 

RXA01959 


bKUU0D4 


2501 


2502 


r*i \y a a A A A A 

RXA01961 


oKUUODO 


2503 


2504 


RXA01962 


oKUUODO 


2505 


2506 


\i a a a r\r> o 

RXA01963 


vjKUUODO 






w~\ w a f\A AA A 

RXA0T964 


VjKUUOOO— 


2507 


2508 






2509 


2510 


|— * \f a f\A A A C 

RXA01965 


oKUUODO 


2511 


2512 


try \J A f\A A A a 

RXA01966 


oKUUODf 


2513 


2514 


■— % w a r\A a a o 

RXA01968 


r*Dfinwi7 
oKUUOOf 


2515 


2516 


r"\ w a A A AAA 

RXA01969 


pDnnRR7 
bKUUOD/ 


2517 


2518 


RXA01973 


GKUUOf U 


2519 


2520 


v/ a a A A "7 A 

RXA01974 


bKUUO/U 


2521 


2522 


\X A f\ A A ~"t A 

RXA01976 




2523 


2524 


RXA01977 


GKUUD/ i 


2525 


2526 


RXA01982 


GKUUOf o 


2527 


2528 


RXA01990 


GKUUOol 


2529 


2530 


RXA01992 


GKUUooo 


2531 


2532 


RXA01999 


GROOoby 


2533 


2534 


RXA02001 


bKUUoyU 


2535 


2536 


RXA02004 


GR00594 


2537 


2538 


RXA02006 


ODHAC07 

bKUUby/ 


2539 


2540 


RXA02009 


GKUUbU i 


2541 


2542 


RXA02013 


GROOdU/ 


2543 


2544 


RXA02021 


GRUUblo 


2545 


2546 


RXA02036 


i^D AAA A A 


2547 


2548 


RXA02040 


GR00621 


2549 


2550 


RXA02046 


GKUUo^o 


2551 


2552 


RXA02051 


GKUUD24 


2553 


2554 


RXA02053 


GR00624 


2555 


2556 


RXA02057 


GR00625 


2557 


2558 


RXA02058 


GROObzo 


2559 


2560 


RXA02069 


GRQQ62/ 


2561 


2562 


RXA02070 




2563 


2564 


\/ a n A A A 

RXA02080 


GKUUbzo 


2565 


2566 


px\/ A ^\ /*V .4 

RXA02081 


GKUUbzo 


2567 


2568 


■-^ \ # a aaaa j 

RXA02084 


GKUUbzy 


2569 " 


2570 


RXA02089- 


- - GKUUbzy 


2571 


2572 


■-^ x # a rtnAAA 

RXA02090 


GKUUbzy 


2573 


2574 


I— | \/ A aAAAj4 

RXA02091 


GKuub^y 


2575 


2576 


r-» W A AAAA^ 

RXA02097 


bKUUboU 


2577 


2578 


r**v/ A A O >4 AO 

RXA02102 


vjKUUDo 1 


2579 


2580 


r™i w a AO A A 

RXA02103 




2581 


2582 


vx a A A <4 A A 

RXA02109 


GKUUboz 


2583 


2584 


p> w A AO <4 A ~7 

RXA02117 




2585 


2586 


n V A AO A oo 

RXA02123 


oKUUDOD 


OCQ7 

2587 


zooo 




GR00636 


2589 


2590 


RXA02125 


GR00637 


2591 


2592 


RXA02129 


GR00637 


2593 


2594 


RXA02132 


GR00638 


2595 


2596 


RXA02137 


GR00639 


2597 


2598 


RXA02141 


GR00639 


2599 


2600 


RXA02146 


GR00639 


2601 


2602 


RXA02152 


GR00640 


2603 


2604 


RXA02163 


GR00640 



1639 
521 
1022 
1757 

-1329 

1935 

47 

3295 

5689 

2 

658 

3742 

4547 

3001 

1 

709 
2384 
700 
3 

498 

127 

553 

2008 

3441 

1452 

2680 

3186 

5484 

2972 

4051 

1116 

1733 

11017 

12307 

2920 

8431 

9764 

10512 

184 

4479 

4510 

3460 

1056 

6558 

7956 

739 

5906 

737 

4166 

8457 

14742 

237 

10072 

IflQOyl 



2019 
1000 
1591 
2440 

4 

1375 

703 

2138 

5216 

583 

2109 

2222 

3972 

1844 

999 

260 

2854 

152 

209 

4 

5 

5 

1061 

3821 

925 

2943 

3683 

6062 

3502 

3500 

1694 

2830 

10211 

13935 

2576 

8901. 

8964 

9862 

3555 

3322 

4905 

2540 

1529 

7928 

9911 

1539 

6139 

1375 

3369 

8864 

15368 

638 

10824 
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2613 

2615 

2617 

2619 

2621— 

2623 

2625 

2627 

2629 

2631 

2633 

2635 

2637 

2639 

2641 

2643 

2645 

2647 

2649 

2651 

2653 

2655 

2657 

2659 

2661 

2663 

2665 

2667 

2669 

2671 

2673 

2675 

2677 

2679 

2681 

2683 

2685 

2687 

2689 

2691 

2693 

2695 

2697 

2699 

2701 

2703 

2705 

2707 

2709 

2711 

2713 

2715 

2717 

2719 

2721 



2614 

2616 

2618 

2620 

-2622— 
2624 
2626 
2628 
2630 
2632 
2634 
2636 
2638 
2640 
2642 
2644 
2646 
2648 
2650 
2652 
2654 
2656 
2658 
2660 
2662 
2664 
2666 
2668 
2670 
2672 
2674 
2676 
2678 
2680 
2682 
2684 
2686 
2688 
2690 
2692 
2694 
2696 
2698 
2700 
2702 
2704 
2706 
2708 
2710 
2712 
2714 
2716 
2718 
2720 
2722 



RXA02170 
RXA02172 
RXA02177 
RXA02178 
-RXA02181— 
RXA02183 
RXA02187 
RXA02199 
RXA02203 
RXA02206 
RXA02211 
RXA02212 
RXA02216 
RXA02217 
RXA02218 
RXA02219 
RXA02221 
RXA02227 
RXA02230 
RXA02231 
RXA02244 
RXA02255 
RXA02266 
RXA02267 
RXA02280 
RXA02286 
RXA02287 
RXA02294 
RXA02295 
RXA02297 
RXA02298 
RXA02304 
RXA02308 
RXA02324 
RXA02325 
RXA02331 
RXA02336 
RXA02347 
RXA02349 
RXA02352 
RXA02356 
RXA02358 
RXA02362 
RXA02374 
RXA02390 
RXA02393 
RXA02395 
RXA02396 
RXA02403 
RXA02412 
RXA02417 
RXA02421 
RXA02425 
RXA02427 
RXA02430 



GRUUb41 


47Q8 
h i yo 


4025 


GRUUb41 


RQ1Q 


6581 


OKUUD4 I 


12683 


13615 


r*or\nAAA 

Gr\UUD4 I 


1^698 


14497 




171RR 


17845 


UKUU04-I 


— 1- /—I VJU 




GR0UO41 


I OOUO 


19187 


GKUUb41 


9194Q 


23447 


GRU0b4b 


95Q1 


3160 


GR0Uo4o 


74RQ 

/ *+OJ7 


7092 


GKUU040 


QQ27 


10862 


GRU0b4o 




2989 


GR00b4y 




467 


GROOobl 


Z 


307 


GRUObol 




306 


oKUUDO I 




1565 


GROUbOl 


1578 


2963 


GROObbl 


fi790 
D f ZU 


8081 


GROObbo 


I ZOO 


1853 


GROUboo 


** I 3D 


3620 


GROObbo 


51 1 1 


4356 


GR00bb4 


I ZUDO 


13590 


GR00654 


99507 

ZZ3U r 


23442 


GROObbo 


£5^ 
OOO 


1165 


GROObbo 


905*3 


1181 


GROOooo 


z 


754 


GROObbU 


^985 

OZOD 


3833 


GROObbU 




4622 


GROOooz 


5QQ9 


5618 


GROUbbz 


fi849 
OO**^ 


6063 


GRUUboZ 


7509 


8638 


GR00dd2 


10^10 


8652 


GRUUbbo 


1R1^ 

ID l O 


723 


GR00bb4 


Q^Q 


511 


GROObbo 


I D*fO 


2633 


GROObbo 


*fO l*T 


3445 


GROObM 


OiJO 


761 


GROOb/z 


97^1 


2552 


GKUUo/ / 


50Q 


189 


GROUb/ o 


^Q4 


5 


GKUUbol 




556 


GR00bo4 


7R1 


1756 


GKUUOOD 


1239 


1529 


oKUUDOO 


7045 


10743 


oKUUOOO 


1626 


2246 


f^poo£Q5 
oKUUDyo 


1500 


832 


GKUUoy / 


168 

I WW 


449 




2 


733 


GR00698 


1309 


1031 


GR00700 


896 


1660 


GR00703 


2043 


2522 


GR00705 


4755 


2632 


GR00705 


7237 


6428 


GR00707 


1 


630 


GR00707 


3447 


3061 


GR00707 


7498 


7683 
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2727 

2729 

2731 

2733 

2735 

2737 

2739 

2741 

2743 

2745 

2747 

2749 

2751 

2753 

2755 

2757 

2759 

2761 

2763 

2765 

2767 

2769 

2771 

2773 

2775 

2777 

2779 

2781 

2783 

2785 

2787 

2789 

2791 

2793 

2795 

2797 

2799 

2801 

2803 

2805 

2807 

2809 

2811 

2813 

2815 

2817 

2819 

2821 

2823 

2825 

2827 

2829 

2831 

2833 

2835 

2837 

2839 



2728 

2730 

2732 

2734 

2736 — 

2738 

2740 

2742 

2744 

2746 

2748 

2750 

2752 

2754 

2756 

2758 

2760 

2762 

2764 

2766 

2768 

2770 

2772 

2774 

2776 

2778 

2780 

2782 

2784 

2786 

2788 

2790 

2792 

2794 

2796 

2798 

2800 

2802 

2804 

2806 

2808 

2810 

2812 

2814 

2816 

2818 

2820 

2822 

2824 

2826 

2828 

2830 

2832 

2834 

2836 

2838 

2840 



RXA02443 

RXA02444 

RXA02452 

RXA02459 

RXA02461 — 

RXA02467 

RXA02472 

RXA02473 

RXA02475 

RXA02478 

RXA02482 

RXA02483 

RXA02484 

RXA02486 

RXA02488 

RXA02489 

RXA02495 

RXA02496 

RXA02498 

RXA02500 

RXA02506 

RXA02514 

RXA02518 

RXA02521 

RXA02524 

RXA02525 

RXA02540 

RXA02544 

RXA02545 

RXA02546 

RXA02549 

RXA02552 

RXA02554 

RXA02569 

RXA02570 

RXA02573 

RXA02575 

RXA02576 

RXA02577 

RXA02584 

RXA02585 

RXA02588 

RXA02591 

RXA02598 

RXA02600 

RXA02601 

RXA02602 

RXA02604 

RXA02609 

RXA02617 

RXA02619 

RXA02620 

RXA02639 

RXA02647 

RXA02649 

RXA02652 

RXA02655 



GR00709 
GR00709 
GR00710 
GR00712 
GR00742 — 
GR00714 
GR00715 
GR00715 
GR00715 
GR00716 
GR00718 
GR00718 
GR00718 
GR00718 
GR00719 
GR00719 
GR00720 
GR00720 
GR00720 
GR00720 
GR00720 
GR00723 
GR00723 
GR00724 
GR00725 
GR00725 
GR00726 
GR00726 
GR00726 
GR00726 
GR00728 
GR00730 
GR00731 
GR00736 
GR00736 
- GR00739 
GR00739 
GR00740 
GR00740 
GR00741 
GR00741 
GR00741 
GR00741 
GR00742 
GR00742 
GR00742 
GR00742 
GR00742 
GR00742 
GR00745 
GR00746 
GR00746 
GR00749 
GR00751 
GR00752 
GR00752 
GR00752 



6818 
7836 
5271 
4341 

-6252 

643 

5435 

6664 

9595 

1245 

914 

1813 

2317 

3441 

1 

373 

9002 

10025 

11016 

13460 

19484 

1 

3464 

2924 

2405 

3113 

12438 

16715 

18749 

19927 

1331 

924 

1050 

82 

837 

594 

1907 

1569 

2463 

8925 

9917 

13037 

15780 

2576 

5027 

5258 

7239 

8800 

16197 

1404 

204 

1192 

511 

4155 

1284 

2973 

9313 



7771 
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